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(57) Primers for synthesizing full length cDNAs and 
their use are provided. 

830 cDNA encoding a human protein has been iso- 
lated and nucleotide sequences of 5'-, and 3'-ends of 
the cDNA have been determined. Furthermore, primers 
for synthesizing the full length cDNA have been provid- 
ed to clarify the function of the protein encoded by the 
cDNA. The full length cDNA of the present invention 
containing the translation start site provides information 
useful for analyzing the functions of the protein. 
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Description 

FIELD OF THE INVENTION 

5 [0001] The present invention relates to a polynucleotide encoding a novel protein, a protein encoded by the polynu- 
cleotide, and new uses of these. 

BACKGROUND OF THE INVENTION 

w [0002] Currently, the sequencing projects, the determination and analysis of the genomic DNA of various living or- 
ganisms have been in progress all over the world. The whole genomic sequences of more than 1 0 species of prokary- 
otes, a lower eukaryote, yeast, and a multicellular eukaryote, C. elegans are already determined. As to human genome, 
which is supposed to be composed of three thousand million base pairs, the world wide cooperative projects have 
been under way to analyze it, and the whole structure is predicted to be determined by the years 2002-2003. The aim 

is of the determination of genomic sequence is to reveal the functions of all genes and their regulation and to understand 
living organisms as a network of interactions between genes, proteins, cells or individuals through deducing the infor- 
mation in a genome, which is a blueprint of the highly complicated living organisms. To understand living organisms 
by utilizing the genomic information from various species is not only important as an academic subject, but also socially 
significant from the viewpoint of industrial application. 

20 [0003] However, determination of genomic sequences itself cannnot identify the functions of all genes. For example, 
as for yeast, only the function of approximately half of the 6000 genes, which is predicted based on the genomic 
sequence, was able to be deduced. As for human, the number of the genes is predicted to be approximately one 
hundred thousand. Therefore, it is desirable to establish "a high throughput analysis system of the gene functions" 
which allows us to identify rapidly and efficiently the functions of vast amounts of the genes obtained by the genomic 

25 sequencing. 

[0004] Many genes in the eukaryotic genome are split by introns into multiple exons. Thus, it is difficult to predict 
correctly the structure of encoded protein solely based on genomic information. In contrast, cDNA, which is produced 
from mRNA that lacks introns, encodes a protein as a single continuous amino acid sequence and allows us to identify 
the primary structure of the protein easily. In human cDNA research, to date, more than one million ESTs (Expression 

30 Sequence Tags) are publicly available, and the ESTs presumably cover not less than 80% of all human genes. 

[0005] The information of ESTs is utilized for analyzing the structure of human genome, or for predicting the exon- 
regions of genomic sequences or their expression profile. However, many human ESTs have been derived from prox- 
imal regions to the 3'-end of cDNA, and information around the 5'-end of mRNA is extremely little. Among these human 
cDNAs, the number of the corresponding mRNAs whose encoding protein sequences are deduced is approximately 

35 7000, and further, the number of full-length therein is only 5500. Thus, even including cDNA registered as EST, the 
percentage of human cDNA obtained so far is estimated to be 10-15% of all the genes. 

[0006] It is possible to identify the transcription start site of mRNA on the genomic sequence based on the 5'-end 
sequence of a full-length cDNA, and to analyze factors involved in the stability of mRNA that is contained in the cDNA, 
or in its regulation of expression at the translation stage. Also, since a full-length cDNA contains ATG, the translation 

40 start site, in the S'-region, it can be translated into a protein in a correct frame. Therefore, it is possible to produce a 
large amount of the protein encoded by the cDNA or to analyze biological activity of the expressed protein by utilizing 
an appropriate expression system. Thus, analysis of a full-length cDNA provides valuable information which comple- 
ments the information from genome sequencing. Also, full-length cDNA clones that can be expressed are extremely 
valuable in empirical analysis of gene function and in industrial application. 

45 [0007] In particular, human secretory proteins or membrane proteins are would be useful by itself as a medicine like 
tissue plasminogen activator (TPA), or as a target of medicines like membrane receptors. In addition, genes for signal 
transduction-associated proteins (protein kinases, etc.), glycoprotein-associated proteins, transcription-associated 
proteins, and disease-associated proteins form a gene group rich in genes whose relationships to human diseases 
have been elucidated. 

so [0008] Therefore, it has great significance to isolate novel full-length cDNA clones of human, only few of which has 
been isolated. Especially, isolation of a novel cDNA clone encoding a secretory protein or membrane protein is desired 
since the protein itself would be useful as a medicine, and also the clones potentially include a gene associated with 
diseases. In addition, genes encoding proteins that are associated with signal transduction, glycoprotein, transcription, 
or diseases are expected to be useful as target molecules for therapy, or as medicines themselves. These genes form 

55 a gene group predicted to be strongly associated with diseases. Thus, identification of the full-length cDNA clones 
encoding those proteins has great significance. 
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SUMMARY OF THE INVENTION 

[0009] An objective of the present invention is to provide a primer that enables synthesizing polynucleotide from 
human, the resulting polynucleotide or its clone, and a protein encoded by the polynucleotide. 
[0010] The inventors have developed a method for efficiently cloning a human full-length cDNA that is predicted by 
the ATGpr etc. to be a full-length cDNA clone, from a full-length-enriched cDNA library that is synthesized by the oligo- 
capping method. Then, the inventors determined the nucleotide sequence of the obtained cDNA clones from both 5'- 
and 3 - ends. By utilizing the sequences, the inventors selected clones that were expected to contain a signal by the 
PSORT (Nakai K. and Kanehisa M. (1 992) Genomics 1 4: 897-91 1 ), and obtained clones that contain a cDNA encoding 
a secretory protein or membrane protein. Moreover, the inventors specifically selected full-length cDNA clones that 
encode secretory or membrane proteins, signal transduction-associated proteins, glycoprotein-associated proteins, 
transcription-associated proteins, or disease-associated proteins from clones homologous to the clones in the Swiss- 
Prot (http://www.ebi.ac.uk/ebi_docsSwissProt_db/swisshome.html) according to the keywords of SwissProt. 
[0011] The full-length cDNA clones of the present invention have high fullness ratio since these were obtained by 
the combination of (1) construction of a full-length-enriched cDNA library that is synthesized by the oligo-capping 
method, and (2) a system in which fullness ratio is evaluated from the nucleotide sequence of the 5'-end (in this system, 
clones are selected based on the estimation by the ATGpr, following the removal of sequences judged not to be full- 
length when compared with ESTs). However, the primers of the present invention enable obtaining full-length cDNA 
easily without any special methods mentioned above. 

[0012] Homology analysis in which the analysis is carried out against a non-full-length cDNA fragment to postulate 
the function of a protein encoded by said fragment, is being commonly performed. However, since such analysis is 
based on the information of the fragment, it is not clear as to whether this fragment corresponds to a part that is 
functionally important in the protein. In other words, the reliability of the homology analysis based on the information 
of a fragment is doubtful, as information relating to the structure of the whole protein is not available. However, the 
homology analysis of the present invention is conducted based on the information of a full-length cDNA comprising 
the whole coding region of the cDNA, and therefore, the homology of various portions of the protein can be analyzed. 
Hence, the reliability of the homology analysis has been dramatically improved in the present invention. 
[0013] The inventors completed the invention by finding that it is possible to synthesize a novel full-length cDNA by 
using the combination of a primer that is designed based on the nucleotide sequence of the 5'-ends of the selected 
full-length cDNA clones and any of an oligo-dT primer or a 3'-primer that is designed based on the nucleotide sequence 
of the 3'-ends of the selected clones. 

[0014] Thus, the present invention relates to primers described below, a method for synthesizing a polynucleotide 
using the primers, and polynucleotides obtained by the method. 
[0015] First, the present invention relates to 

(1) use of an oligonucleotide as a primer for synthesizing the polynucleotide comprising the nucleotide sequence 
set forth in any one of SEQ ID NOs: 1-829 and 2545, or the complementary strand thereof, wherein said oligonu- 
cleotide is complementary to said polynucleotide or the complementary strand thereof and comprises at least 15 
nucleotides; 

(2) a primer set for synthesizing polynucleotides, the primer set comprising an oligo-dT primer and an oligonucle- 
otide complementary to the complementary strand of the polynucleotide comprising the nucleotide sequence set 
forth in any one of SEQ ID NOs: 1-829 and 2545, wherein said oligonucleotide comprises at least 15 nucleotides; 
and 

(3) A primer set for synthesizing polynucleotides, the primer set comprising a combination of an oligonucleotide 
comprising a nucleotide sequence complementary to the complementary strand of the polynucleotide comprising 
a 5'-end nucleotide sequence and an oligonucleotide comprising a nucleotide sequence complementary to the 
polynucleotide comprising a 3'-end nucleotide sequence, wherein said oligonucleotides comprise at least 15 nu- 
cleotides and wherein said combination of 5'-end nucleotide sequence / 3'-end nucleotide sequence is selected 
from the combinations of 5'-end nucleotide sequence / 3'-end nucleotide sequence set forth in the SEQ ID NOs 
in Table 1 . 

[0016] Table 1 shows names of clones obtained in the examples described later, comprising the polynucleotide of 
the present invention (830 clones), names of nucleotide sequences at the 5'-end and 3'-end of the full-length cDNA, 
and their corresponding SEQ ID NOs. A blank indicates that the of the 3'-end sequence corresponding to the 5'-end 
sequence has not been determined the same clone. 

[0017] The SEQ ID NO of a 5'-end sequence is shown on the right side of the name of the 5-end sequence, and the 
SEQ ID NO of a 3'-end sequence is shown on the right side of the name of the 3'-end sequence. 
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Table 1 



Correspondence between names of clone and the sequence name, and the SEQ ID. 
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Name of S'-sequence 


SEQ ID 


3 Name of 3'-sequence 
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Table 1 (continued) 
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Correspondence between names of clone and the sequence name, and the SEQ ID. 
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Name of 5'-sequence 


SEQ ID 


3 Name of 3*-sequence 


SEQ ID 
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908 


HEMBA1 005489 


F-HEMBA1 005489 


90 


R-HEMBA1 005489 


909 


HEMBA1 005522 


F-HEMBA1 005522 


91 


R-HEMBA1 005522 


910 


HEMBA1 005545 


F-HEMBA1 005545 


92 


R-HEMBA1 005545 


911 


HEMBA1 005698 


F-HEMBA1 005698 


93 


R-HEMBA1 005698 


912 


HEMBA1005913 


F-HEMBA1005913 


94 


R-HEMBA1005913 


913 


HEMBA1 005929 


F-HEMBA1 005929 


95 


R-HEMBA1 005929 


914 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



Correspondence between names of clone and the sequence name, and the SEQ ID. 



Name of clone 


Name of 5'-sequence 


ceo m 
ocu iu 


O Klorfin /\f O ' AAA! 1 A A A A 

o i\ame ot o -sequence 


ecn in 


HEMBA1 005945 


F-HEMBA1 005945 


96 


R-HEMBA 1 005945 


915 


HEMBA1006016 


r | I I™ ft J n A A AAA A A A 

F-HEMBA1 006016 


97 


R-HEMBA 1 00601 6 


916 


HEMBA1006171 


^ 1 1 f" ft J n jh A AAA A **♦ A 

F-HEMBA1006171 


98 


R-HEMBA1 0061 71 


917 


HEMBA1 006276 


F-HEMBA1 006276 


99 


R-HEMBA 1 006276 


918 


HEMBA1 006299 


F-HEMBA1 006299 


100 


R-HEMBA 1 006299 


919 


HEMBA1 006311 


F-HEMBA1 006311 


101 


1 1 1" ft M n A A A A /*! A A 

R-HEMBA 1006311 


920 


HEMBA1 006335 


F-HEMBA1 006335 


102 


n ■ i i"~ m an a a f\/\r*r\r\rr 

R-HEMBA1 006335 


921 


HEMBA1 006357 


F-HEMBA1 006357 


103 


R-HEMBA1 006357 


922 


HEMBA1 006430 


F-HEMBA1 006430 


104 


R-HEMBA 1 006430 


923 


HEMBA1 006482 


F-HEMBA1 006482 


105 


i~% t i r~ i in a a f\f\r* a e\e\ 

R-HEMBA1 006482 


924 


HEMBA1006517 


F-HEMBA1006517 


106 


R-HEMBA1 006517 


925 


HEMBA1 006544 


F-HEMBA1 006544 


107 


R-HEMBA1 006544 


926 


HEMBA1 006572 


F-HEMBA1 006572 


108 


R-HEMBA 1 006572 


927 


HEMBA1 006658 


F-HEMBA1 006658 


109 


R-HEMBA1 006658 


928 


HEMBA1 006707 


F-HEMBA1 006707 


110 


R-HEMBA 1 006707 


929 


HEMBA1 006724 


F-HEMBA1 006724 


111 


R-HEMBA1 006724 


930 


HEMBA1 006749 


F-HEMBA1 006749 


112 


R-HEMBA1 006749 


931 


HEMBA1 006770 


F-HEMBA1 006770 


113 


R-HEMBA 1006770 


932 


HEMBA1 006902 


F-HEMBA1 006902 


114 


R-HEMBA1 006902 


933 


HEMBA1006912 


F-HEMBA1006912 


115 


R-HEMBA1006912 


934 


HEMBA1006916 


F-HEMBA1006916 


116 


R-HEMBA1 006916 


935 


HEMBA1 006960 


F-HEMBA1 006960 


117 


R-HEMBA1 006960 


936 


HEMBA1007013 


F-HEMBA1007013 


118 


R-HEMBA1 007013 


937 


HEMBA1 007057 


F-HEMBA1 007057 


119 


R-HEMBA1 007057 


938 


HEMBA1 007063 


F-HEMBA1 007063 


120 


R-HEMBA1 007063 


939 


HEMBA1 007226 


F-HEMBA1 007226 


121 






HEMBA1 007241 


F-HEMBA1 007241 


122 


R-HEMBA1 007241 


940 


HEMBA1007291 


F-HEMBA1 007291 


123 


R-HEMBA1 007291 


941 


HEMBA1 007332 


F-HEMBA1 007332 


124 


R-HEMBA1 007332 


942 


HEMBB1000106 


F-HEMBB1000106 


125 


R-HEMBB1000106 


943 


HEMBB1000276 


F-HEMBB1 000276 


126 


R-HEMBB1 000276 


944 


HEMBB1000309 


F-HEMBB1 000309 


127 


R-HEMBB1 000309 


945 


HEMBB1000407 


F-HEMBB1 000407 


128 


R-HEMBB1 000407 


946 


HEMBB1 000447 


F-HEMBB1 000447 


129 


R-HEMBB1 000447 


947 


HEMBB1 000542 


F-HEMBB1 000542 


130 


R-HEMBB1 000542 


948 


HEMBB1 000567 


F-HEMBB1 000567 


131 


R-HEMBB1 000567 


949 


HEMBB1 000642 


F-HEMBB1 000642 


132 


r~t i i a n n <4 aaao a n 

R-HEMBB1 000642 


950 


HEMBB1000668 


F-HEMBB1 000668 


133 


R-HEMBB1 000668 


951 


HEMBB1 000679 


F-HEMBB1 000679 


134 


R-HEMBB1 000679 


952 


HEMBB1000881 


^ 1 I n ft A f\ A A A A A O A 

F-HEMBB1 000881 


135 


|— 1 it r— » a n n 4 AAftO O -4 

R-HEMBB1 000881 


953 


HFMRRmnnpn^ 

II C l VI D D 1 \J\JKJ&\J<J 


F-HFMRR1 OOOPOS 


136 


R-HFMBB1 000905 


954 


HEMBB1001026 


F-HEMBB1001026 


137 


R-HEMBB1001026 


955 


HEMBB1001048 


F-HEMBB1001048 


138 


R-HEMBB1001048 


956 


HEMBB1001200 


F-HEMBB1001200 


139 


R-HEMBB1001200 


957 


HEMBB1001407 


F-HEMBB1001407 


140 


R-HEMBB1001407 


958 


HEMBB1001530 


F-HEMBB1001 530 


141 


R-HEMBB1001530 


959 


HEMBB1001547 


F-HEMBB1001547 


142 


R-HEMBB1001547 


960 


HEMBB1001573 


F-HEMBB1001573 


143 


R-HEMBB1001573 


961 
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Table 1 (continued) 



5 



10 



20 



Correspondence between names of clone and the sequence name, and the SEQ ID. 


Name of clone 


Name of 5'-sequence 


SEQ ID 


3 Name of 3'-sequence 


SEQ ID 


Y79AA1001795 


F-Y79AA1 001795 


816 


R-Y79AA1001795 


1560 


Y79AA1001799 


F-Y79AA1 001799 


817 


R-Y79AA1001799 


1561 


Y79AA1001803 


F-Y79AA1 001803 


818 


R-Y79AA1001803 


1562 


Y79AA1001863 


F-Y79AA1001863 


819 


R-Y79AA1001863 


1563 


Y79AA1 002022 


F-Y79AA1 002022 


820 


R-Y79AA1 002022 


1564 


Y79AA1 002058 


F-Y79AA1 002058 


821 






Y79AA1002121 


F-Y79AA1002121 


822 


R-nnnnnnnnnnnn 


1565 


Y79AA1002129 


F-Y79AA1 0021 29 


823 


R-nnnnnnnnnnnn 


1 ODD 


Y79AA1 002213 


F-Y79AA1002213 


824 


R-Y79AA1002213 


1567 


Y79AA1 002334 


F-Y79AA1 002334 


825 


R-Y79AA1 002334 


1568 


Y79AA1 002373 


F-Y79AA1 002373 


826 


R-Y79AA1 002373 


1569 


Y79AA1 002376 


F-Y79AA1 002376 


827 


R-Y79AA1 002376 


1570 


Y79AA1 002378 


F-Y79AA1 002378 


828 


R-Y79AA1 002378 


1571 


Y79AA1 002381 


F-Y79AA1 002381 


829 


R-Y79AA1 002381 


1572 


NT2RP2006580 


F-NT2RP2006580 


2545 


R-NT2RP2006580 


2546 



The sequence name starting from "F" means the name of 5'-end sequence, and the sequence name starting from "R" 
means the name of 3'-end sequence. A blank indicates that the 3'-end sequence corresponding to the 5'-end sequence 
25 has not been determined in the clone. 

[0018] Furthermore, the present invention relates to the use of the above primers, as described below. 

(4) A polynucleotide which can be synthesized with the primer set of (2) or (3). 

(5) A polynucleotide comprising a coding region in the polynucleotide of (4). 
30 (6) A substantially pure protein encoded by polynucleotide of (4). 

(7) A partial peptide of the protein of (6). 

[0019] In addition, the present invention comprises a polynucleotide described below and a protein encoded by the 
polynucleotide. 

35 

(8) An isolated polynucleotide selected from the group consisting of 

(a) a polynucleotide comprising a coding region of the nucleotide sequence set forth in any one of the SEQ 
ID NOs in fable 370; 

40 (b) a polynucleotide comprising a nucleotide sequence encoding a protein comprising the amino acid sequence 

set forth in any one of the SEQ ID NOs in Table 370; 

(c) a polynucleotide comprising a nucleotide sequence encoding a protein comprising an amino acid sequence 
selected from the amino acid sequences set forth in the SEQ ID NOs in Table 370, in which one or more amino 
acids are substituted, deleted, inserted, and/or added, wherein said protein is functionally equivalent to the 

45 protein comprising said amino acid sequence selected from the amino acid sequences set forth in the SEQ 

ID NOs in Table 370; 

(d) a polynucleotide that hybridizes with a polynucleotide comprising a nucleotide sequence selected from the 
nucleotide sequences set forth in the SEQ ID NOs in Table 370, and that comprises a nucleotide sequence 
encoding a protein functionally equivalent to the protein encoded by the nucleotide sequence selected from 

so the nucleotide sequences set forth in the SEQ ID NOs in Table 370; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a protein 
encoded by the polynucleotide of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence with at least 70% identity to the nucleotide sequence 
set forth in any one of the SEQ ID NOs in Table 370. 

55 

(9) . A substantially pure protein encoded by the polynucleotide of (8). 

(10) An antibody against the protein or peptide of any one of (6), (7), and (9). 

(11) A vector comprising the polynucleotide of (5) or (8). 
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(12) A transformant carrying the polynucleotide of (5) or (8), or the vector of (11). 

(13) A transformant expressively carrying the polynucleotide of (5) or (8), or the vector of (11). 

(14) A method for producing the protein or peptide of any one of (6), (7), and (9), comprising culturing the trans- 
formant of (13) and recovering the expression product. 

5 (1 5) An oligonucleotide comprising; the nucleotide sequence set forth in any one of the SEQ ID NOs in Table 370 

or the nucleotide sequence complementary to the complementary strand thereof, wherein said oligonucleotide 
comprises 15 nucleotides or more. 

(1 6) Use of the oligonucleotide of (15) as a primer for synthesizing a polynucleotide. 

(17) Use of the oligonucleotide of (15) as a probe for detecting a gene. 

10 (18) An antisense polynucleotide against the polynucleotide of (8), or the portion thereof. 

(19) A method for synthesizing a polynucleotide, the method comprising: 

a) synthesizing a complementary strand using a cDNA library as a template, and using the primer set of (2) 
or (3), or the primer of (16); and 
15 b) recovering the synthesized product. 

(20) The method of (19), wherein the cDNA library is obtainable by oligo-capping method. 

(21) The method of (19), wherein the complementary strand is obtainable by PCR. 

(22) A method for detecting the polynucleotide of (8), the method comprising: 

20 

a) incubating a target polynucleotide with the oligonucleotide of (1 5) under the conditions where hybridization 
occurs, and 

b) detecting the hybridization of the target polynucleotide with the oligonucleotide of (15). 

25 (23) A database of polynucleotides and/or proteins, the database comprising information on at least one sequence 

selected from the nucleotide sequences set forth in the SEQ ID NOs in Table 370 and/or the amino acid sequences 
set forth in the SEQ ID NOs in Table 370, or a medium on which the database is stored. 

[0020] Any patents, patent applications, and publications cited herein are incorporated by reference. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0021] 

35 Figure 1 shows the restriction maps of vectors pME18SFL3 and pUC19FL3. 

Figure 2 shows the reproducibility of gene expression analysis. The ordinate and the abscissa show the intensities 
of gene expression obtained in experiments different from each other. 

Figure 3 shows the detection limit in gene expression analysis. The intensity of expression is shown in the ordinate, 
and the concentration (ng/ml) of the probe used is shown in the abscissa. 

40 

DETAILED DESCRIPTION OF THE INVENTION 

[0022] Herein, "polynucleotide" is defined as a molecule in which multiple nucleotides are polymerized. There are 
no limitations in the number of the polymerized nucleotides. In case that the polymer contains relatively low number 
45 of nucleotides, it is also described as an "oligonucleotide". The polynucleotide or the oligonucleotide of the present 
invention can be a natural or chemically synthesized product. Alternatively, it can be synthesized using a template DNA 
by an enzymatic reaction such as PCR. 

[0023] All the cDNA provided by the invention are full-length cDNA. Herein, a "full-length cDNA" is defined as a cDNA 
which contains both ATG codon (the translation start site) and the stop codon. Accordingly, the untranslated regions, 
50 which are originally found in the upstream or downstream of the protein coding region in natural mRNA, may or may 
not be contained. 

An "isolated polynucleotide" is a polynucleotide the structure of which is not identical to that of any naturally occurring 
nucleic acid or to that of any fragment of a naturally occurring genomic nucleic acid spanning more than three separate 
genes. The term therefore covers, for example, 

55 

(a) a DNA which has the sequence of part of a naturally occurring genomic DNA molecule but is not flanked by 
both of the coding sequences that flank that part of the molecule in the genome of the organism in which it naturally 
occurs; 
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(b) a nucleic acid incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner 
such that the resulting molecule is not identical to any naturally occurring vector or genomic DNA; 

(c) a separate molecule such as a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction 
(PCR), or a restriction fragment; and 

5 (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein. Spe- 

cifically excluded from this definition are nucleic acids present in mixtures of different (i) DNA molecules, (ii) trans- 
fected cells, or (iii) cell clones: e.g., as these occur in a DNA library such as a cDNA or genomic DNA library. 

[0024] The term "substantially pure" as used herein in reference to a given polypeptide means that the protein or 
w polypeptide is substantially free from other biological macromolecules. The substantially pure protein or polypeptide 
is at least 75% (e.g., at least 80, 85, 95, or 99%) pure by dry weight. Purity can be measured by any appropriate 
standard method, for example, by column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. 
[0025] All the clones of the present invention (830 clones) are novel and covering full-length, and also predicted to 
encode any of the following functional protein: 

15 

secretory proteins, 
membrane proteins, 

proteins associated to signal transduction (signal transduction-associated proteins; e.g. protein kinases, etc.), 
proteins associated to a glycoprotein (glycoprotein-associated proteins), 
20 proteins associated with transcription (transcription-associated proteins), 

proteins associated with diseases (disease-associated proteins), 

or, enzymes and/or metabolism-associated proteins, cell division- and/or cell proliferation-associated proteins, 
cytoskeleton-associated proteins, nuclear proteins, DNA-and/or RNA-binding proteins, ATP- and/or GTP-binding 
proteins, protein synthesis- and/or protein transport-associated proteins, and cellular defense-associated proteins. 

25 

[0026] Furthermore, all the cDNA clones of the present invention can be characterized as follows: 

(1 ) a cDNA that is obtained by the oligo-capping method, which provides cDNA with high fullness ratio. The cDNA 
was selected by the score in the ATGpr (described as ATGpM, as well), which is a program for prediction of the 

30 fullness of the 5'-end of cDNA based on the features of the 5'-end sequence. In addition, the PSORT, which is a 

program for prediction of the existence of the signal sequence selected, cDNA that contains a signal sequence in 
the 5'-end, or transmembrane region in the protein coding region. Furthermore, the homology search with the 5'- 
end sequences confirmed that, the selected clones were not identical to any of the known human mRNA (namely 
novel); 

35 or, 

(2) a cDNA that is obtained by the oligo-capping method, which provides cDNA with high fullness ratio. The cDNA 
was selected by the score in the ATGpr, which is a program for prediction of the fullness of the 5'-end based on 
the features of the 5'-end sequence. Furthermore, the a cDNA that has relative homology with an amino acid 
sequence of a protein with known functions was selected by the BLAST search (Altschul S.F., Gish W., Miller W. t 

40 Myers E.W., and Lipman D.J. (1990) J. Mol. Biol. 215: 403-410 ; Gish W., and States D.J. (1993) Nature Genet. 

3: 266-272) on the SwissProt database using the 5'-end sequence. In addition, the homology search using the 5'- 
end sequence confirmed that the selected clones were not identical to any of the known human mRNA (namely 
novel). 

45 [0027] All clones are obtainable as a full-length clone by such a method as PCR (Current Protocols in Molecular 
Biology, Ausubel et al. edit, (1987) John Wiley & Sons, Section 6.1-6.4) using both the 5'- and 3'-end sequences, or 
using the 5'-end sequence and an oligo-dT primer that corresponds to the polyA sequence. 
[0028] Specifically, PCR can be performed using an oligonucleotide that has 1 5 nucleotides longer, and specifically 
hybridizes with the complementary strand of the polynucleotide that contains the nucleotide sequence selected from 

50 the 5'-end sequences shown in Table 1 (SEQ ID NO: 1-829, and SEQ ID NO: 2545), and an oligo-dT primer as a 5'-, 
and S'-primer, respectively. The length of the primers is usually 15-100 bp, and favorably between 15-35 bp. In case 
of LA PCR, which is described below, the primer length of 25-35 bp may provide a good result. 
[0029] A method to design a primer that enables a specific amplification based on the given nucleotide sequence is 
known to those skilled in the art (Current Protocols in Molecular Biology, Ausubel et al. edit, (1 987) John Wiley & Sons, 

55 Section 6.1-6.4). In designing a primer based on the 5'-end sequence, the primer is designed so as that, in principle, 
the amplification products will include the translation start site. Accordingly, in case that a given 5*-end nucleotide 
sequence is the 5'-untranslated region (5'UTR), any part of the sequence can be used as a 5'-primer as far as the 
specificity toward the target cDNA is insured. The translation start site can be predicted using a known method such 
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as the ATGpr as described below. 

[0030] When synthesizing a polynucleotide, the target nucleotide sequence to be amplified can extend to several 
thousand bp in some cDNA. However, it is possible to amplify such a long nucleotides by using such as LA PCR (Long 
and Accurate PCR). It is advantageous to use LA PCR when synthesizing long DNA In LA PCR, in which a special 
DNA polymerase having 3' D5' exonuclease activity is used, misincorporated nucleotides can be removed. Accordingly, 
accurate synthesis of the complementary strand can be achieved even with a long nucleotide sequence. By using LA 
PCR, it is reported that amplification of a nucleotide with 20 kb longer can be achieved under desirable condition 
(Takeshi Hayashi (1996) Jikken-lgaku Bessatsu, "Advanced Technologies in PCR" Youdo-sha). 
[0031] A template DNA for synthesizing the cDNA of the present invention can be obtained by using cDNA libraries 
that are prepared by various methods. The full-length cDNA clones obtained here are those with high fullness ratio, 
which were obtained using a combination of (1) a method to prepare a full-length-enriched cDNA library using the 
oligo-capping method, and (2) an estimation system for fullness using the 5'-end sequence (selection based on the 
estimation by the ATGpr after removing clones that are non-full-length compared to the ESTs). However, it is possible 
to easily obtain a full-length cDNA by using the primers that are provided by the present invention, not by the above 
described specialized method. 

[0032] The problem with the cDNA libraries prepared by the known methods or commercially available is that mRNA 
contained in the libraries has very low fullness ratio. Thus, it is difficult to screen full-length cDNA clone directly from 
the library using ordinary cloning methods. The present invention has revealed a primer that is capable of synthesizing 
a full-length cDNA. If provided with primers, it is possible to synthesize a target full-length cDNA by using enzymatic 
reactions such as PCR. In particular, a full-length-enriched cDNA library, synthesized by methods such as oligo-cap- 
ping, is desirable to synthesize a full-length cDNA with more reliability. 

[0033] Once the nucleotide sequences of the full-length cDNAs obtained in the present invention is determined, it is 
possible to predict the functions of the proteins encoded by the cDNA clones, for example, by searching the databases 
such as GenBank (http://www.ncbi.nlm.nih.gov/web/GenBank/), Swiss-Prot (http://www.ebi.ac.uk/ebi_docsSwiss- 
Prot_db/swisshome.html),UniGene (http://www.ncbi. nlm.nih.gov/UniGene) for homologies of the cDNAs, or by search- 
ing the amino acid sequences deduced from the full-length nucleotide sequences for signal sequence by using software 
such as PSORT (K. Nakai & M. Kanehisa, Genomics, 1 4: 897-991 (1 992), for transmembrane region by using software 
such as SOSUI (T. Hirokawa et al., Bioinformatics, 1 4:378-379 (1 998); Mitsui Knowledge Industry Co., Ltd.) or for motif 
by using software such as Pfam (http://www.sanger.ac.uk/Software/Pfam/index.shtml) or PROSITE (http://www.ex- 
pasy.ch/prosite). As a matter of course, the functions are often predictable by using partial sequence information (pref- 
erably 300 nucleotides or more) instead of the fult-length nucleotide sequences. However, the result of the prediction 
obtained by using partial sequence information does not always agree with the result obtained by using full-length 
nucleotide sequence, and thus it is needless to say that the prediction of function is preferably performed based on 
the full-length nucleotide sequences. 

[0034] Homology search using each of GenBank, Swiss-Prot and UniGene was performed for the 826 clones whose 
full-length nucleotide sequences had been determined (H EM BA 1005337, NT2RM1000407, NT2RM2001767, and 
NT2RP3003939 are not full-length). The amino acid sequences deduced from the full-length nucleotide sequences 
were searched for functional domains by using analytical software programs, PSORT, SOSUI and Pfam. Based on the 
results, proteins encoded by the cDNA clones were grouped into some categories and their functions were predicted. 
[0035] The following 437 clones were categorized into secretory and/or membrane proteins. The clones categorized 
into secretory and/or membrane proteins are those which matched the full-length sequences of Swiss-Prot database 
with the keywords "growth factor", "cytokine", "hormone", "signal", "transmembrane", "membrane", "extracellular ma- 
trix", "receptor", "G-protein coupled receptor", "ionic channel", "voltage-gated channel", "calcium channel", "cell adhe- 
sion", "collagen* or "connective tissue"; those which matched the data, suggesting that the proteins are secretory and/ 
or membrane proteins; or those which matched with the full-length sequences of GenBank or UniGene database similar 
description; and, further, those predicted to have an N-terminal signal sequence or a transmembrane region as a result 
of domain search for the amino acid sequences deduced from the full-length nucleotide sequences. 
BNGH41 000020, BNGH41 000087, BNGH41 000091, HEMBA1000121 , HEMBA1000128, HEMBA1 000349, 



HEMBA1 000477, 
HEMBA1 000940, 
HEMBA1 002163, 
HEMBA1 002421, 
HEMBA1 003530, 
HEMBA1 004391, 
HEMBA1 005070, 
HEMBA1006171, 
HEMBA1 006482, 
HEMBA1 007241, 



HEMBA1 000590, 
HEMBA1 000962, 
HEMBA1002167, 
HEMBA1 002767, 
HEMBA1 003602, 
HEMBA1 004444, 
HEMBA1 005449, 
HEMBA1 006299, 
HEMBA1 006707, 
HEMBB1 000407, 



HEMBA1 000713, 
HEMBA1001221, 
HEMBA1002178, 
HEMBA1 003047, 
HEMBA1 003732, 
HEMBA1 004454, 
HEMBA1 005522, 
HEMBA1 006311, 
HEMBA1 006724, 
HEMBB1 000447, 



HEMBA1 000732, 
HEMBA1001228, 
HEMBA1002195, 
HEMBA1003101, 
HEMBA1 003945, 
HEMBA1 004505, 
HEMBA1005545, 
HEMBA1006335, 
HEMBA1006749, 
HEMBB1000567, 



HEMBA1 000745, 
HEMBA1001621, 
HEMBA1 002227, 
HEMBA1 003230, 
HEMBA1004110, 
HEMBA1 004797, 
HEMBA1 005698, 
HEMBA1 006357, 
HEMBA1 006902, 
HEMBB1 000679, 



HEMBA1 000835, 
HEMBA1002131, 
HEMBA1 002420, 
HEMBA1 003392, 
HEMBA1 004250, 
HEMBA1 004982, 
HEMBA1 005945, 
HEMBA1 006430, 
HEMBA1 006960, 
HEMBB1000881, 



24 



EP1 130 094 A2 



w 



15 



20 



25 



30 



35 



40 



45 



50 



55 



HEMBB1001026, HEMBB1001048, HEMBB1001407, HEMBB1001530, HEMBB1001573, HEMBB1001847, 

HEMBB1001978, HEMBB1 002041, HEMBB1002162, HEMBB1 002245, HEMBB1 002427, HEMBB1 002693, 

MAMMA10001O2, MAMMA1000106, MAMMA1000118, MAMMA1000141, MAMMA1000204, MAMMA1000226, 

MAMMA1 000457, MAMMA1 000473, MAMMA1 000496, MAMMA1 000591, MAMMA1 000681, MAM MA 1000810, 
MAM MA1 000986, MAMMA1 000994, MAMMA1001043, MAMMA1001141, MAMMA1001237, MAMMA1001344, 

MAMMA1001418, MAMMA1001893, MAMMA1001957, MAMMA1001 978, 

MAMMA1 002070, MAMMA1 002091, MAMMA1 002095, MAMMA1002165, MAMMA1 002234, MAMMA1 002586, 



MAMMA1 002633, MAMMA1 0031 26, 
NT2RM1 000858, NT2RM 1 000899 , 



NT2RM2001626 

NT2RM4000417 

NT2RM4000997, 

NT2RP1 000050, 

NT2RP1 000551 

NT2RP2000533 

NT2RP2000903, 

NT2RP2001755, 

NT2RP2002527, 

NT2RP2003042 : 

NT2RP2003655, 

NT2RP2004524, 

NT2RP2005163 : 

NT2RP2005666, 

NT2RP2006042 : 

NT2RP3000304 : 

NT2RP3000818, 

NT2RP3001240 

NT2RP3001650 

NT2RP3002342 

NT2RP3002958, 

NT2RP3003614 : 

NT2RP3004309 

NT2RP3004647 

NT2RP4002451 

OVARC1000313, 

OVARC1 000956, 

OVARC 1001725, 

PLACE1 000912 

PLACE1001231 

PLACE1001536 

PLACE1001836 

PLACE1 002374, 

PLACE1 002967 

PLACE1 003845 

PLACE1 004482 

PLACE1 005003 

PLACE1 005544 

PLACE1 005927 

PLACE1 006443 

PLACE1 007296 

PLACE1008716 

PLACE1 009527 

PLACE1010251 

PLACE1011116, 

PLACE3000213 

THYR01 000099 

THYR01 000795 

THYRO1001205 



NT2RM1 000462, NT2RM1 000542, NT2RM1 000580, NT2RM1 000855, 
NT2RM2000241 , NT2RM200041 0, NT2RM2000423, NT2RM2000565, 



NT2RM2001792, NT2RM2001939, 
NT2RM4000444, NT2RM4000587, 
NT2RM4001 325, NT2RM4001 735, 
NT2RP1 0001 81 , NT2RP1 000261 , 
NT2RP1000613, NT2RP1 000981 , 
NT2RP2000616, NT2RP2000649, 
NT2RP2001200, NT2RP2001276, 
NT2RP2001 91 5, NT2RP2001 956, 
NT2RP2002533, NT2RP2002721 , 
NT2RP2003210, NT2RP2003383, 
NT2RP2003664, NT2RP2003950, 
NT2RP2004556, NT2RP2004606, 
NT2RP20051 81 , NT2RP2005378, 
NT2RP2005883, NT2RP2005994, NT2RP2006004, 

NT2RP2006269, NT2RP1006512, NT2RP2006580, 
NT2RP3000460, NT2RP300061 6, 
NT2RP3000921 , NT2RP3001 01 2, 
NT2RP3001 322, NT2RP3001 388, 
NT2RP3001 858, NT2RP300201 5, 
NT2RP3002737, NT2RP3002790, 
NT2RP3003076, NT2RP3003354, 
NT2RP3004075, NT2RP3004083, 
NT2RP3004406, NT2RP3004481 , 
NT2 RP400 1009, NT2 R P400 1 467, 



NT2RM2001 941 , NT2RM40001 98, NT2RM4000284, 

NT2RM4000593, NT2RM4000648, NT2RM4000761 , 

NT2RM4001768, NT2RM4001843, NT2RM4002352, 

NT2RP1 000325, NT2RP1 000448, 

NT2RP1001563, NT2RP2000479, 

NT2RP2000694, NT2RP200081 8, 

NT2RP2001495, NT2RP2001514, 

NT2RP20021 88, NT2RP2002232, 

NT2RP2002942, NT2RP2002976, 

NT2RP2003469, NT2RP2003593, 

NT2RP2004205, NT2RP2004495, 

NT2RP2004794, NT2RP2005027, 

NT2RP2005541 , NT2RP2005597, 



NT2RP1 000300, 
NT2RP1001004, 
NT2RP2000663, 
NT2RP2001480, 
NT2RP2002063, 
NT2RP2002824, 
NT2RP2003390, 
NT2RP2004179, 
NT2RP2004648, 
NT2RP2005463, 



NT2RP3000436, 
NT2RP3000907, 
NT2RP3001271, 
NT2RP3001738, 
NT2RP3002411, 
NT2RP3003000, 
NT2RP3004025, 
NT2RP3004345, 
NT2RP4001001, 
NT2RP4002750, 
OVARC1000410, 
OVARC1001030, 
OVARC1001991, 
PLACE1000914, 
PLACE1001340, 



NT2RP3000169, 
NT2RP3000676, 
NT2RP3001159, 
NT2RP3001560, 
NT2RP3002160, 
NT2RP3002836, 
NT2RP3003532, 
NT2RP3004130, 
NT2RP3004552, 
NT2RP4001879, 



OVARC1000003, OVARC1000105, OVARC1000298, 



NT2RP3000171, 
NT2RP3000721, 
NT2RP3001195, 
NT2RP3001592, 
NT2RP3002311, 
NT2RP3002900, 
NT2RP3003535, 
. NT2RP3004133, 
NT2RP3004625, 
NT2RP4002187, 
OVARC1 000307, 



OVARC1 000439, OVARC1 000553, 

OVARC1 001 1 63, OVARC1 001 336, 

PLACE1 000033, PLACE1 000231 , 

PLACE1 000927, PLACE1 001 01 6, 

PLACE1 001 401 , PLACE1 001 407, 
PLACE1001564, PLACE1001655, PLACE1001795, 

PLACE1 001 91 8, PLACE1 001 949, PLACE1 002080, 



OVARC1 00081 1 , OVARC1 000873, 
OVARC1 001 570, OVARC1 001 607, 



PLACE1 000560, 
PLACE1001123, 
PLACE1001464, 



PLACE1 000740, 
PLACE1001183, 
PLACE1001516, 



PLACE1 002095, PLACE1 002355, 

PLACE1 002726, PLACE 1 002905, PLACE 1 00291 1 , 

PLACE1 003737, PLACE 1 003772, PLACE1 003839, 

PLACE1 004282, PLACE 1 004441 , PLACE1 004450, 

PLACE1 004657, PLACE 1 004648, PLACE1 00481 6, 

PLACE1 005383, PLACE 1 00541 0, PLACE1 005426, 

PLACE1 005725, PLACE 1 005745, PLACE 1 005878, 

PLACE1 006208, PLACE1 006277, PLACE 1 006290, 

PLACE1 006959, PLACE1 007081 , PLACE 1 007096, 

PLACE1 007881 , PLACE 1 008359, PLACE 1 008469, 

PLACE1 009067, PLACE1 0091 96, PLACE 1 009279, 

PLACE1 009982, PLACE1 01 001 1 , PLACE 1 01 0078, 

PLACE1 01 0784, PLACE 1 01 0827, PLACE1 01 0968, 

PLACE1 01 1516, PLACE1 01 1 708, PLACE30001 81 , 

SKNMC1 00001 4, SKNMC1 000082, THYR01 000036, 

THYR01 000400, THYR01 000584, THYR01 000678, THYP01 000776, 
THYR01 000956, THYR01 0011 02, THYR01 001 1 1 3, 

THYR01 001 237, THYR01 001 242, THYR01 001 266, THYR01 001 327, THYRQ1 001 456, 



PLACE1002518, 
PLACE 1003407, 
PLACE1 003852, 
PLACE1 004520, 
PLACE1 005005, 
PLACE1 005569, 
PLACE1 006071, 
PLACE1006716, 
PLACE1 007626, 
PLACE1 008744, 
PLACE1 009546, 
PLACE1010445, 
PLACE1011181, 
PLACE4000354, 
THYRO1000196, 



PLACE1 002547, 
PLACE 1003573, 
PLACE1 004279, 
PLACE 1004630, 
PLACE1 005031, 
PLACE1 005660, 
PLACE1 006093, 
PLACE1 006809, 
PLACE1 007845, 
PLACE 1008985, 
PLACE1 009600, 
PLACE1010713, 
PLACE1011236, 
SKNMC1 000004, 
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THYRO1001478, THYRO1001523, THYR01 001529, THYRO1001641 , THYR01 001702, THYRO1001725, 
Y79AA1 000207, Y79AA1 000226, Y79AA1 000270, Y79AA1 000426, Y79AA1 000521, Y79AA1 000876, 
Y79AA1000888, Y79AA1000959, Y79AA1001013, Y79AA1001212, Y79AA1001264, Y79AA1 001 328, 
Y79AA1001426, Y79AA1001427, Y79AA1001430, Y79AA1001727, Y79AA1001787, Y79AA1001795, 
Y79AA1001799, Y79AA1001803, Y79AA1 002022, Y79AA1 002058, Y79AA1002129, Y79AA1002213, 
Y79AA1 002373, 

[0036] The following 1 46 clones were categorized into glycoprotein-associated proteins. The clones categorized into 
glycoprotein-associated proteins are those which matched the full-length sequences of Swiss-Prot database with the 
keyword "glycoprotein"; those which matched the data suggesting that the proteins are glycoprotein; or those which 
matched the full-length sequences of GenBank or UniGene database with similar description. 
BNGH41 000087, BNGH41 000091 , HEMBA1 000349, HEMBA1 000590, HEMBA1 000745, 



HEMBA1001221, 
HEMBA1 002767, 
HEMBA1 004505, 
HEMBA1 006902, 
HEMBB1 002427, 
MAMMA1 002070, 
NT2RM2001818, 
NT2RM4000648, 
NT2RP1001004, 
NT2RP2002533, 
NT2RP2005181, 
NT2RP2006269, 
NT2RP3002737, 
NT2RP3004481, 
OVARC1 000003, 
OVARC1001991, 
PLACE1001123, 
PLACE1 002374, 
PLACE1 003839, 
PLACE1 005003, 
PLACE1 007845, 
PLACE1010968, 
THYRO1000196, 



HEMBA1001621, 
HEMBA1003392, 
HEMBA1 005522, 
HEMBB1 000881, 
MAMMA1 000591, 
MAMMA1003126, 
NT2RM2001941, 
NT2RM4001325, 
NT2RP2000694, 
NT2RP2003210, 
NT2RP2005597, 
NT2RP3000616, 
NT2RP3003000, 
NT2RP3004640, 
OVARC1 000811, 
PLACE1 000033, PLACE1 000740, PLACE1001016, 
PLACE1 001 231 , PLACE1 001 464, PLACE1 001 655, 
PLACE1 002911, 
PLACE1 004441, 
PLACE1 006071, 
PLACE1 008744, 
PLACE3000181, 
THYR01 000956, 



HEMBA1001228, 
HEMBA1 003230, 
HEMBA1 005449, 
HEMBB1 000679, 
MAMMA1000102, 
MAMMA1 002586, 
NT2RM2001939, 
NT2RM4000997, 
NT2RP2000616, 
NT2RP2003042, 
NT2RP2005541, 
NT2RP3000304, 
NT2RP3002958, 
NT2RP3004552, 
OVARC1 000553, 



PLACE1 002905, 
PLACE1 004282, 
PLACE 1005426, 
PLACE1008716, 
PLACE1011116, 
THYRQ1 000584, 



HEMBA1002131, 
HEMBA1 004250, 
HEMBA1 005545, 
HEMBB1001048, 
MAMMA1 000681, 
NT2RM1 000462, 
NT2RM4000198, 
NT2RM4002352, 
NT2RP2000903, 
NT2RP2004205, 
NT2RP2005883, 
NT2RP3000921, 
NT2RP3003532, 
NT2RP4000108, 
OVARC1 000873, 



PLACE1 003573, 
PLACE1 004450, 
PLACE1 006073, 
PLACE1 008985, 
PLACE3000213, 
THYRO1001266, 



HEMBA1002178, 
HEMBA1 004391, 
HEMBA1 006707, 
HEMBB1002120, 
MAMMA1001043, 
NT2RM1 000580, 
NT2RM4000284, 
NT2RP1000613, 
NT2RP2001480, 
NT2RP2004606, 
NT2RP2006004, 
NT2RP3001650, 
NT2RP3004130, 
NT2RP4001467, 
OVARC1001336, 

PLACE1001836, 
PLACE1 003737, 
PLACE1 004520, 
PLACE1 006290, 
PLACE1010251, 
PLACE4000354, 
Y79AA1 000270, 



HEMBA1 000835, 
HEMBA1 002421, 
HEMBA1 004444, 
HEMBA1 006749, 
HEMBB1 002245, 
MAMMA1001237, 
NT2RM2001792, 
NT2RM4000417, 
NT2RP1 000981, 
NT2RP2001755, 
NT2RP2005027, 
NT2RP2006042, 
NT2RP3002160, 
NT2RP3004133, 
NT2RP4002750, 
OVARC1001607, 

PLACE1 002355, 
PLACE1 003772, 
PLACE1 004648, 
PLACE1 007081, 
PLACE1010784, 
THYR01 000036, 
Y79AA1 000426, 



Y79AA1001727, Y79AA1001795, Y79AA1 002022, Y79AA1 002213, 

[0037] The following 57 clones were categorized into signal transduction-associated proteins. The clones categorized 
into signal transduction-associated proteins are those which matched the full-length sequences of Swiss-Prot database 
with the keywords "serine/threonine-protein kinase", "tyrosine-protein kinase" or "SH3 domain"; those which matched 
the data suggesting that the proteins are signal transduction-associated proteins (for example, "ADP-ribosylation fac- 
tor"); or those which matched the full-length sequences of GenBank or UniGene database with similar description; 
and, further, those which was similarly predicted to be signal transduction-associated proteins based on the matching 
data of Pfam. 



HEMBA1 000006, 
HEMBA1 006658, 
NT2RM2001902, 
NT2RP2003179, 
NT2RP3000201, 
NT2RP4000634, 
OVARC1001569, 
PLACE1 008282, 
THYRO1001593, 



HEMBA1002195, 
HEMBA1006916, 
NT2RP1001020, 
NT2RP2003545, 
NT2RP3000820, 
NT2RP4000962, 
PLACE1 002329, 
PLACE1 008297, 
THYRO1001700, 



HEMBA1 002227, 
MAMMA1 000881, 
NT2RP1001031, 
NT2RP2004670, 
NT2RP3003527, 
OVARC1 000255, 
PLACE1003135, 
PLACE1010081, 
THYRO1001770, 



HEMBA1 002551, 
MAMMA1001150, 
NT2RP2001469, 
NT2RP3000011, 
NT2RP3003849, 
OVARC1 000529, 
PLACE1 003598, 
PLACE1011364, 
Y79AA1 000777, 



HEMBA1 005084, 
MAMMA1001310, 
NT2RP2001529, 
NT2RP3000022, 
NT2RP3003874, 
OVARC1000916, 
PLACE1005519, 
PLACE1011824, 
Y79AA1 000967, 



HEMBA1 005929, 

MAMMA1002142, 
NT2RP2001769, 
NT2RP3000172, 
NT2RP3004067, 

OVARC1001338, 
PLACE1 006208, 

THYRO1001457, 
Y79AA 1002376, 



Y79AA1 002381, HEMBB1 000668, NT2RM4001377 

[0038] The following 81 clones were categorized into transcription-associated proteins. The clones categorized into 
transcription-associated proteins are those which keywords "transcription regulation", "zinc finger" or "homeobox" 
matched the full-length sequences of Swiss-Prot database; those which the matched the data suggesting that the 
proteins were transcription-associated proteins; or those which matched the full-length sequences of GenBank or Uni- 
Gene database with similar description; and, further, those which was similarly predicted to be transcription-associated 
proteins based on the matching data of Pfam. 
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HEMBA1000462, HEMBA1000671 , HEMBA1001297, HEMBA1001390, HEMBA1001886, HEMBA1002048, 
HEMBA1003120, HEMBA1 003497, HEMBA1 004785, HEMBA1 005230, HEMBA1 005246, HEMBA1 006276, 
HEMBA1 006572, HEMBA1 007226, HEMBB1000106, HEMBB1 000905, HEMBB1001959, HEMBB1 002051 , 
HEMBB1002661, MAMMA1001094, MAMMA1001532, MAMMA1001615, NT2RM1000789, NT2RM2000632, 

5 NT2RM2000773, NT2RM4000326, NT2RP1 000271 , NT2RP1 000468, NT2RP2000092, NT2RP2000610, 
NT2RP2000712, NT2RP2000739, NT2RP2001538, NT2RP2001662, NT2RP2001817, NT2RP2001948, 
NT2RP2002564, NT2RP2002974, NT2RP2003138, NT2RP2003302, NT2RP2003940, NT2RP2004108, 
NT2RP2004847, NT2RP2005247, NT2RP2005391, NT2RP2005535, NT2RP2005774, NT2RP2005941 , 
NT2RP2006092, NT2RP3000148, NT2RP3000232, NT2RP3000378, NT2RP3000652, NT2RP3001976, 

w NT2RP3004090, NT2RP3004119, NT2RP3004294, OVARC1001049, OVARC1001086, OVARC1001132, 
OVARC1001807, PLACE1 000258, PLACE1 000442, PLACE1 000907, PLACE1 003529, PLACE1004166, 
PLACE1004168, PLACE 1004887, PLACE1 005250, PLACE1 005682, PLACE1 006079, PLACE1 008549, 
PLACE1011407, PLACE1011978, THYR01 000580, Y79AA1 000030, Y79AA1001090, Y79AA1001523, 
Y79AA1 002334, Y79AA1 002378, HEMBB1 002302, 

15 [0039] The following 85 clones were categorized into disease-associated proteins. The clones categorized into dis- 
ease-associated proteins are those which matched the full-length sequences of Swiss-Prot database with the keywords 
"disease mutation" or "syndrome"; those which matched the data suggesting that the proteins are disease-associated 
proteins; or those which matched the full-length sequences of Swiss-Prot database and GenBank or UniGene database 
where the matched sequences are those of genes or proteins which had been deposited in the database of Online 

20 Mendelian Inheritance in Man (OMIM) (http://www.ncbi.nlm.nih.gov/Omim/), which is a database of human genes and 
diseases. 

BNGH41000020, HEMBA1000349, HEMBA1000590, HEMBA1 000671 , HEMBA1 000835, HEMBA1001184, 
HEMBA1001228, HEMBA1001886, HEMBA1003120, HEMBA 1004250, HEMBA1005246, HEMBA1 005267, 
HEMBA1006707, HEMBA1006749, HEMBA 1006902, HEMBA1006916, HEMBA1007013, HEMBB1002120, 

25 MAMMA1 000204, MAMMA1 002080, NT2RM2000632, NT2RM2001126, NT2RM2001558, NT2RP1 000271, 
NT2RP1 000465, NT2RP1 000579, NT2RP2000447, NT2RP2000514, NT2RP2000739, NT2RP2001223, 
NT2RP2001529, NT2RP2001562, NT2RP2002674, NT2RP2003369, NT2RP2004108, NT2RP2004205, 
NT2RP2005535, NT2RP2005941 , NT2RP2006004, NT2RP3000059, NT2RP3000125, NT2RP3000201 , 
NT2RP3000232, NT2RP3000616, NT2RP3000677, NT2RP3000838, NT2RP3000921 , NT2RP3001542, 

30 NT2RP3002286, NT2RP3002721, NT2RP3002737, NT2RP3002738, NT2RP3004481 , OVARC1 000208, 
OVARC1 000275, OVARC1 000331 , OVARC1 000410, OVARC1001086, OVARC1001132, OVARC1001607, 
OVARC1001725, OVARC1001952, PLACE1 000258, PLACE1 000442, PLACE1 000907, PLACE1001100, 
PLACE1001500, PLACE1 002905, PLACE1 002967, PLACE 1003407, PLACE1 003428, PLACE1 005005, 
PLACE1 005239, PLACE1005815, PLACE1 007028, PLACE1008716, PLACE1011407, PLACE1011978, 

35 PLACE2000118, THYP01 000580, THYR01 000866, THYRO1001071, THYRO1001478, Y79AA1001062, 
Y79AA1001530, 

[0040] It is unclear, by the analyses so far, whether or not the remaining 212 clones encode proteins belonging to 
any of the categories of secretory and/or membrane proteins, glycoprotein-associated proteins, signal transduction 
-associated proteins, transcription-associated proteins or disease-associated proteins. Nonetheless, it is still possible 
40 for these clones to encode secretory and/or membrane proteins, glycoprotein-associated proteins, signal transduction- 
associated proteins, transcription-associated proteins, or disease-associated proteins. On the other hand, some of 
these clones can be presumed to have functions other than those as secretory and/or membrane proteins, glycoprotein- 
associated proteins, signal transduction-associated proteins, transcription-associated proteins and disease-associated 
proteins. 

45 [0041 ] Among the 21 2 clones, the following clones presumably belong to the categories of enzymes and/or metab- 
olism-associated proteins, cell division- and/or cell proliferation-associated proteins, cytoskeleton-associated proteins, 
nuclear proteins, DNA- and/or RNA-binding proteins, ATP-and/or GTP-binding proteins, protein synthesis- and/or pro- 
tein transport-associated proteins, or cellular defense-associated proteins, although it is unclear whether or not the 
clones belong to any of the categories of secretory and/or transmembrane proteins, glycoprotein-associated proteins, 

so signal transduction-associated proteins, transcription-associated proteins, and disease-associated proteins. 

[0042] The following 10 clones presumably belong to the category of enzymes and/or metabolism-associated pro- 
teins. The clones herein defined as clones presumably belonging to the category of enzymes and/or metabolism- 
associated proteins matched data containing keywords such as "metabolism", "oxidoreductase" and "E.C. No. (Enzyme 
commission number)". 

55 HEMBA1003315, HEMBB1 002465, MAMMA1000614, NT2RP2000178, NT2RP2001388, NT2RP2001903, 
NT2RP2002304, NT2RP2005878, NT2RP3001685, PLACE1006219 

[0043] The following 4 clones presumably belong to the category of cell division- and/or cell proliferation-associated 
proteins. The cDNA clones were herein defined as clones presumably belonging to the category of cell division- and/ 
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or cell proliferation-associated proteins matched data containing keywords such as "cell division", "cell cycle", "mitosis", 

"chromosomal protein", "cell growth" and "apoptosis". 

MAM MA 1000403, NT2RM2000497, NT2RP2000394, Y79AA1002121 

[0044] The following 6 clones presumably belong to the category of cytoskeleton-associated proteins. The cDNA 
5 clones were herein defined as clones presumably belonging to the category of cytoskeleton-associated proteins 
matched data containing keywords such as "structural protein", "cytoskeleton", "actin-binding" and "microtubules". 
MAMMA 1001609, NT2RM2000589, NT2RP3000063, PLACE 1004078, PLACE 1004492, PLACE 1008657 
[0045] The following 7 clones presumably belong to the category of nuclear proteins. The cDNA clones were herein 
defined as clones presumably belonging to the category of nuclear proteins matched data containing keywords such 
10 as "nuclear protein". 

HEMBA1001878, HEMBA1 002992, MAMMA1000614, NT2RM4000965, NT2RM2001738, NT2RP2001388, 
Y79AA1002121 

[0046] The following 5 clones presumably belong to the category of DNA- and/or RNA-binding proteins. The cDNA 
clones were herein defined as clones presumably belonging to the category of DNA- and/or RNA-binding proteins 
15 matched data containing keywords such as "DNA-binding" and "RNA-binding". 

HEMBA1003072, HEMBA1006770, HEMBA1007332, NT2RM2000497, Y79AA1002121 

[0047] The following 7 clones presumably belong to the category of ATP- and/or GTP-binding proteins. The cDNA 
clones were herein defined as clones presumably belonging to the category of ATP- and/or GTP-binding proteins 
matched data containing keywords such as "ATP-binding n and "GTP-binding". 
20 HEMBA1002316, MAMMA1001609, NT2RM2000306, NT2RM2000497, NT2RP2000178, NT2RP3003729, 
PLACE1 004305 

[0048] The following 7 clones presumably belong to the category of protein synthesis- and/or protein transport-as- 
sociated proteins. The cDNA clones were herein defined as clones presumably belonging to the category of protein 
synthesis-associated and/or protein transport-associated proteins matched data containing keywords such as "trans- 
25 lation regulation", "protein biosynthesis", "amino-acid biosynthesis", "ribosomal protein", "protein transport" and "signal 
recognition particle". 

NT2RM4000965, NT2RP2005069, NT2RP3000481 , NT2RP3000789, NT2RP4001877, OVARC1001833, 
OVARC1 002058, 

[0049] The following 1 clone presumably belongs to the category of cellular defense-associated proteins. The cDNA 
30 clones were herein defined as clones presumably belonging to the category of cellular defense-associated proteins 
matched data containing keywords such as "heat shock", "DNA repair" and "DNA damage". 
PLACE1 005539 

[0050] Although it is unclear whether or not 26 out of 1 74 clones other than the above-mentioned clones belong to 
any of the above-described categories, these clones are predicted to have some functions, based on the homology 
35 search using their full-length sequences thereof. The clone names and the gene definitions found in the result of ho- 
mology search are shown below, separated with a double-slash mark, //. 

* 

HEMBA1000634//Homo sapiens T-cell activation protein (PGR1) gene, complete cds. 
HEMBA1002524//Human MHC Class I region proline rich protein mRNA, complete cds. 
40 HEMBA1003399//MVP1 PROTEIN. 

HEMBA1005489//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, com- 
plete cds. 

HEMBB1000542//Mus musculus bromodomain-containing protein BP75 mRNA, complete cds. 

MAMMA1000788//Bos taurus P14 (p14) mRNA, complete cds. 
45 MAMMA1 0021 28//ABC1 PROTEIN HOMOLOG PRECURSOR. 

NT2RM2000514//Homo Sapiens F-box protein Fbx21 (FBX21) mRNA, complete cds. 

NT2RM2000622//Mus musculus F-box protein FBL10 mRNA, partial cds. 

NT2RM4000100//Homo sapiens Leman coiled-coil protein (LCCP) mRNA, complete cds. 

NT2RP2005425//Homo sapiens mRNA for AKAP450 protein. 
so NT2RP3001 1 70//Mus musculus activity-dependent neuroprotective protein (Adnp) mRNA, complete cds. 

NT2RP3002571//Bos taurus mRNA for iyncein. 

NT2RP3004557//Human Ki nuclear autoantigen mRNA, complete cds. 

OVARC1001596//Homo sapiens Arf-like 2 binding protein BART1 mRNA, complete cds. 

PLACE1002153//Homo sapiens TACC2 protein (TACC2) mRNA, partial cds. 
55 PLACE1 0031 63//Homo sapiens DBI-related protein mRNA, complete cds. 

PLACE1005736//Human mRNA for BAS-GRIP protein. 

PLACE 1007702//M us musculus TRA1 mRNA, complete cds. 

PLACE 1 01 1045//Homo sapiens E1-like protein mRNA, complete cds. 
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THYR01 000061 //Mus musculus mRNA for UBE-1c1 , UBE-1c2, UBE-1c3, complete cds. 

THYR01 000964//Drosophila melanogaster Pelle associated protein Pellino (Pli) mRNA, complete cds. 

Y79AA1000776//Mus musculus mRNA for GSG1, complete cds. 

Y79AA1001056//Homo sapiens MAID protein mRNA, complete cds. 

Y79AA1001272//Homo sapiens retinoic acid repressible protein (RARG-1) mRNA, complete cds. 
Y79AA1001793//Mus musculus mRNA for GSG1, complete cds. 



[0051] So far, useful information for presuming the functions are unavailable for the remaining 148 clones, whose 
names are listed below. 



HEMBA1 000275, 

HEMBA1001296, 

HEMBA1 003487, 

HEMBA1 005145, 

HEMBA1006912, 

HEMBB1 000642, 

MAMMA1 000046, 

MAMMA1001066, 

MAMMA1 002205, 

NT2RM4001321, 

NT2RP2000240, 

NT2RP2003599, 

NT2RP2005514, 

NT2RP3000444, 

NT2RP3002281, 

NT2RP3002887, 



HEMBA1 000300, 
HEMBA1001563, 
HEMBA1 004007, 
HEMBA1 005430, 
HEMBA1 007057, 
HEMBB1001200, 
MAMMA1 000449, 
MAMMA1001284, 
MAMMA1 002224, 
NT2RP1 000002, 
NT2RP2001878, 
NT2RP2003931 , 
NT2RP2005632, 
NT2RP3000645, 
NT2RP3002324, 
NT2RP3002983, 



HEMBA1 000443, 
HEMBA1002164, 
HEMBA1 004067, 
HEMBA1005913, 
HEMBA1007063, 
HEMBB1 001547, 
MAMMA1 000528, 
MAMMA1001623, 
NT2RM2000582, 
NT2RP1 000239, 
NT2RP2001921, 
NT2RP2004069, 
NT2RP2005887, 
NT2RP3000871, 
NT2RP3002353, 
NT2RP3003448, 



HEMBA1 000875, 
HEMBA1 002239, 
HEMBA1 004085, 
HEMBA1006016, 
HEMBA1 007291, 
HEMBB1 002039, 
MAM MA1 000652, 
MAMMA1001634, 
NT2RM2001643, 
NT2RP1 000679, 
NT2RP2002015, 
NT2RP2004141, 
NT2RP2006099, 
NT2RP3001044, 
NT2RP3002409, 
NT2RP3003469, 



NT2RP3003963, NT2RP3004000, NT2RP3004202, NT2RP3004321 , 



NT2RP3004355, 
OVARC 1000775, 
OVARC1002178, 
PLACE1 003460, 
PLACE1 005669, 
PLACE1 007591, 
THYR01 000846, 



NT2RP3004374, 
OVARC 1000853, 
PLACE1 000986, 
PLACE1 003644, 
PLACE1 005768, 
PLACE1 007971, 
THYROI 000999, 



NT2RP4002715, 
OVARC1 000995, 
PLACE1001114, 
PLACE1 004028, 
PLACE1006515, 
PLACE1 008984, 
THYR01 001063, 



OVARC1 000090, 
OVARC1001222, 
PLACE1001229, 
PLACE1004199, 
PLACE1 006786, 
PLACE1 009735, 
THYRO1001128, 



HEMBA1 000907, 
HEMBA1 002985, 
HEMBA1 004952, 
HEMBA1006517, 
HEMBB1 000276, 
HEMBB1 002228, 
MAMMA1 000706, 
MAMMA1001901, 
NT2RM4000115, 
NT2RP1 000740, 
NT2RP2002409, 
NT2RP2004447, 
NT2RP2006134, 
NT2RP3001061, 
NT2RP3002448, 
NT2RP3003473, 

OVARC1000137, 
OVARC1001260, 
PLACE1001788, 
PLACE1004519, 
PLACE1 007040, 
PLACE2000219, 
THYRO1001471, 



HEMBA1001272, 
HEMBA1 003294, 
HEMBA1 004971, 
HEMBA1 006544, 
HEMBB1 000309, 
HEMBB1002663, 
MAMMA1000814, 
MAMMA1 002087, 
NT2RM4000295, 
NT2RP1 000903, 
NT2RP2002510, 
NT2RP2004837, 
NT2RP3000427, 
NT2RP3001754, 
NT2RP3002664, 
NT2RP3003559, 

OVARC1 000467, 
OVARC1001727, 
PLACE1 003438, 
PLACE1 005601, 
PLACE1 007077, 
PLACE4000455, 
THYRO1001495, 



THYRO1001608, THYRO1001803, Y79AA1000127, Y79AA1 000750, Y79AA1001592, Y79AA1001863, 
[0052] In the 437 clones categorized into secretory and/or membrane proteins by using their full-length sequences, 
410 clones were also predicted to encode proteins having functions of secretory and/or membrane proteins by using 
their partial nucleotide sequences. In the 146 clones categorized into glycoprotein-associated proteins by using their 
full-length sequences, 1 24 clones were also predicted to encode proteins having functions of glycoprotein-associated 
proteins by using their partial nucleotide sequences. In the 57 clones categorized into signal transduction-associated 
proteins by using their full-length sequences, 46 clones were also predicted to encode proteins having functions of 
signal transduction-associated proteins by using their partial nucleotide sequences. In the 81 clones categorized into 
transcription-associated proteins by using their full-length sequences, 57 clones were also predicted to encode proteins 
having functions of transcription-associated proteins by using their partial nucleotide sequences. In the 85 clones cat- 
egorized into disease-associated proteins by using their full-length sequences, 6 clones were also predicted to encode 
proteins having functions of disease-associated proteins by using their partial nucleotide sequences. The number of 
clones, which were predicted to encode disease-associated proteins based on the full-length nucleotide sequences, 
is much greater than that predicted based on the partial sequences. The reason is that the full-length sequences were 
categorized by using the data found in the OMIM database into the category of disease-associated proteins. 
[0053] In some cases, the predicted functions based on the partial sequences are different from those based on the 
full-length sequences. The reason is that a protein does not always belong solely to a single category of the above- 
described functional categories, and therefore, it is possible for the protein to belong to both of the predicted functional 
categories. Besides, additional functions can be found for the clones classified into these functional categories by 
further analyses. 

[0054] The following list shows the cDNA clones predicted and selected on the basis of the partial sequences (5* 
sequences) as cDNAs encoding secretory and/or membrane proteins, glycoprotein-associated proteins, signal trans- 
duction-associated proteins, transcription-associated proteins, or disease-associated proteins. 
[0055] The clones that are selected by the score in the ATGpr and by the PSORT for the existence of a signal 
sequence can be expected to encode a secretory or membrane protein since they are predicted to possess the secretion 
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signal or a transmembrane region. The clones that are selected by the score in the ATGpr and by the PSORT for the 
existence of a signal sequence are listed below (254 clones). 

HEMBA1 000300 HEMBA1000713 HEMBA1 000907 
5 HEMBA1 000962 HEMBA1001272 HEMBA1001297 

HEMBA1002164 HEMBA1002239 HEMBA1002420 

HEMBA1 002421 HEMBA1003101 HEMBA1 003294 

HEMBA1 003399 HEMBA1 003602 HEMBA1 003732 

HEMBA1004110 HEMBA1 004797 HEMBA1 005430 
10 HEMBA1006016 HEMBA1006171 HEMBA1006311 

HEMBA1006335 HEMBA1006357 HEMBA1006572 

HEMBA1006658 HEMBA1006707 HEMBA1006902 

HEMBA1 006960 HEMBA1007013 HEMBB1 000276 

HEMBB1000447 HEMBB1000567 HEMBB1000642 
15 HEMBB1000905 HEMBB1001200 HEMBB1001407 

HEMBB1001530 HEMBB1001547 HEMBB1001978 

HEMBB1002162 HEMBB1002228 HEMBB1002245 

HEMBB1002427 HEMBB1002465 HEMBB1002663 

HEMBB1 002693 MAMMA1 000046 MAMMA1000102 
20 MAMMA1 0001 1 8 MAMMA1 0001 41 MAMMA1 000449 

MAMMA1 000457 MAMMA1 000591 MAMMA1 000652 

MAMMA1 000681 MAMMA1 000986 MAMMA1 000994 

MAMMA1001043 MAMMA1001141 MAMMA1001284 

MAMMA1001310 MAMMA1001344 MAMMA1001893 
25 MAMMA1 001 901 MAMMA1 001 957 MAMMA1 002070 

MAMMA1 002087 MAMMA1002165 MAMMA1 002205 

MAMMA1 002224 MAMMA1 002633 NT2RM2000241 

NT2RM2000306 NT2RM200041 0 NT2RM200051 4 

NT2RM2001643 NT2RM2001941 NT2RM4000115 
30 NT2RM4000997NT2RM4001321 NT2RM4001325 

NT2RM4001768 NT2RP1 000050 NT2RP1 000448 

NT2RP1 000903 NT2RP1 001 563 NT2RP2000479 

NT2RP2001495 NT2RP2001915 NT2RP2001948 

NT2RP2002015 NT2RP2002063 NT2RP2002304 
35 NT2RP2002674 NT2RP2002721 NT2RP2003383 

NT2RP2003469 NT2RP2003593 NT2RP2003599 

NT2RP2003655 NT2RP2003664 NT2RP20041 79 

NT2RP2004447 NT2RP2004495 NT2RP2004524 

NT2RP2004556 NT2RP2004837 NT2RP2005027 
40 NT2RP2005463 NT2RP200551 4 NT2RP2005887 

NT2RP2006042 NT2RP2006269 NT2RP3000169 

NT2RP3000460 NT2RP3000481 NT2RP3000645 

NT2RP3000789 NT2RP3000818 NT2RP3001012 

NT2RP3001044 NT2RP3001195 NT2RP3001560 
45 NT2RP3001685 NT2RP3001858 NT2RP3002160 

NT2RP3002281 NT2RP3002721 N12RP3002836 

NT2RP3002958 NT2RP3003076 NT2RP3003354 

NT2RP3003469 NT2RP3003535 NT2RP3003559 

NT2RP3003963 NT2RP3004000 NT2RP3004083 
50 NT2RP30041 33 NT2RP3004309 NT2RP3004321 

NT2RP3004355 NT2RP3004374 NT2RP4001001 

NT2RP4001879 NT2RP4002451 NT2RP4002715 

OVARC1 000208 OVARC1 000298 OVARC1 000439 

OVARC1000775 OVARC1 000811 OVARC1000853 
55 OVARC1 001222 OVARC1001727 OVARC1001807 

OVARC1001833 PLACE1 000231 PLACE1 000560 

PLACE1 000740 PLACE1000912 PLACE1000914 

PLACE1 000927 PLACE1 000986 PLACE1001100 
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PLACE1001183 PLACE1001229 PLACE1001407 

PLACE1001536 PLACE1001788 PLACE1 002080 

PLACE1 002095 PLACE1 002374 PLACE1002518 

PLACE1 003407 PLACE1 003428 PLACE1 003460 
5 PLACE1 003839 PLACE1 003845 PLACE1 004028 

PLACE1004199 PLACE1 004282 PLACE1 004305 

PLACE 1004482 PLACE1 004637 PLACE1 005005 

PLACE1 005250 PLACE1 005383 PLACE1 005410 

PLACE1 005544 PLACE1 005569 PLACE1 005601 
10 PLACE1 005660 PLACE1 005669 PLACE 1 005725 

PLACE1 005768 PLACE1 005927 PLACE1 006079 

PLACE1 006093 PLACE1 006219 PLACE1 006277 

PLACE1 006443 PLACE1 006786 PLACE1 006809 

PLACE1 007040 PLACE1 007096 PLACE1 007296 
15 PLACE1 007626 PLACE1 007971 PLACE1 008469 

PLACE1 008984 PLACE1 008985 PLACE1 009067 

PLACE1 0091 96 PLACE1 009527 PLACE1 009982 

PLACE1010251 PLACE1 011236 PLACE2000219 

PLACE4000455 SKNMC1 000004 SKNMC1000014 
20 THYR01 000036 THYR01 000099 THYRO1000196 

THYR01 000795 THYR01 000999 THYRO1001237 

THYR01 001 327 THYR01 001 478 TH YR01 001 495 

THYRO1001523 THYRO1001702 THYRO1001725 

Y79AA1 000226 Y79AA1 000270 Y79AA1 000426 
25 Y79AA1 000521 Y79AA1 000776 Y79AA1 000959 

Y79AA1001013 Y79AA1001056 Y79AA1001264 

Y79AA1 001 328 Y79AA1 001 427 Y79AA1 001 430 

Y79AA1001530 Y79AA1001592 Y79AA1001793 

Y79AA1 001 795 Y79M1 001 803 Y79AA1 001 863 
30 Y79AA1 002022 Y79AA1 002373 

[0056] In the example mentioned below, the 254 clones as described above were categorized into three groups 
according to their maximal value in the ATGpr and the result in the PSORT, which are shown in Table 7-10,11,12 (246 
clones), and Table 13, 14, 15 {8 clones). In the tables, the name of clone, indicate the name of the clone that was 

35 selected by the ATGpr and the PSORT; the name of sequence indicates the name of the 5'-end sequence of the clone 
on the left; the maximal ATGpr score indicates the maximal ATGprl score of the 5'-end sequence shown on the left; 
and signal indicates the presence of the signal sequence according to the prediction by the PSORT. In addition, the 
representative sequence is the sequence that has the longest sequence among the cluster in which the 5'-end sequence 
on the left was included. The maximal ATGpr score and signal on the right indicate the maximal ATGprl score of the 

40 representative sequence, and the presence of a signal sequence in the representative sequence according to the 
prediction by the PSORT, respectively. The 1 70 clones shown in Table 7-1 0, having the maximal score in the ATGprl 
higher than 0.5, and predicted to possess a signal sequence by the PSORT, are very likely to be full-length and encode 
a secretory or membrane protein. The 35 clones in Table 1 1 , which have the maximal score in the ATGprl 0.3 or higher 
and less than 0.5, and predicted to have a signal sequence, are also as well. And, the 41 clones in Table 12, having 

45 the maximal score in the ATGprl 0 or higher and less than 0.3, and predicted to have a signal sequence, are likely to 
be full-length and encode a secretory or membrane protein. 

[0057] The 8 clones in Table 13 (4 clones), Table 14 (2 clones), and Table 1 5 (2 clones) have the maximal score in 
the ATGprl 0.5 or higher, 0.3 or higher and less than 0.5, and 0 or higher and less than 0.3, respectively, and are 
predicted to have no signal sequence by the PSORT. However, these clones contain a region that is recognized by 
so the PSORT to be a signal sequence within the representative sequence composing the same cluster. Thus, the clones 
were judged as a full-length clone which encodes a membrane protein, especially. 

[0058] The clones selected by the score in the ATGpr and by the keywords in the top hit data in the SwissProt are 
likely to encode a secretory or membrane protein, or proteins with functions associated to signal transduction, glyco- 
protein, transcription, and diseases according to the respective keywords. These 659 clones are shown below. Here, 
55 top hit data is defined to be data of known amino acid sequence which is identified to be the most homologous sequence 
in homology search using the SwissProt. 

BNGH41 000020 BNGH41 000087 BNGH41 000091 
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HEMBA1000006 HEMBA1000121 HEMBN1000128 
HEMBA1 000275 HEMBA1 000349 HEMBA1 000443 
HEMBA1 000462 HEMBA1 000477 HEMBA1 000590 
HEMBA1 000634 HEMBA1 000671 HEMBA1 000732 

5 HEMBA1 000745 HEMBA1 000835 HEMBA1 000875 

HEMBA1 000907 HEMBA1 000940 HEMBA1 001 1 84 
HEMBA1001221 HEMBA1001228 HEMBA1001296 
HEMBA1001390 HEMBA1001563 HEMBA1001621 
HEMBA1001878 HEMBA1001886 HEMBA1002048 

10 HEMBA1002131 HEMBA1002163 HEMBA1002164 

HEMBA1002167HEMBA1002178 HEMBA1002195 
HEMBA1002227 HEMBA1002316 HEMBA1002421 
HEMBA1002524 HEMBA1 002551 HEMBA1 002767 
HEMBA1002985 HEMBA1002992 HEMBA1003047 

15 HEMBA1003072 HEMBA1003101 HEMBA1003120 

HEMBA1003230 HEMBA1003315 HEMBA1003392 
HEMBA1003487 HEMBA1003497 HEMBA1003530 
HEMBA1003945 HEMBA1004007 HEMBA1004067 
HEMBA1 004085 HEMBA1 004250 HEMBA1 004391 

20 HEMBA1004444 HEMBA1 004454 HEMBA1 004505 

HEMBA1004785 HEMBA1 004797 HEMBA1 004952 
HEMBA1 004971 HEMBA1 004982 HEMBA1 005070 
HEMBA1005084 HEMBA1005145 HEMBA1005230 
HEMBA1 005246 HEMBA1 005267 HEMBA1 005337 

25 HEMBA1 005449 HEMBA1 005489 HEMBA1 005522 

HEMBA1 005545 HEMBA1 005698 HEMBA1005913 
HEMBA1 005929 HEMBA1 005945 HEMBA1 006276 
HEMBA1 006299 HEMBA1 006335 HEMBA1 006430 
HEMBA1 006482 HEMBA1006517 HEMBA1 006544 

30 HEMBA1 006572 HEMBA1 006707 HEMBA1 006724 

HEMBA1 006749 HEMBA1 006770 HEMBA1 006902 
HEMBA1006912 HEMBA1006916 HEMBA1007013 
HEMBA1 007057 HEMBA1 007063 HEMBA1 007226 
HEMBA1 007241 HEMBA1 007291 HEMBA1 007332 

35 HEMBB1 0001 06 HEMBB1 000309 HEMBB1 000407 

HEMBB1 000447 HEMBB1 000542 HEMBB1 000567 
HEMBB1 000668 HEMBB1 000679 HEMBB1 000881 
HEMBB1001026 HEMBB1001048 HEMBB1001200 
HEMBB1001573 HEMBB1001847 HEMBB1001959 

40 HEMBB1 002039 HEMBB1 002041 HEMBB1002051 

HEMBB1002120 HEMBB1 002302 HEMBB1 002427 
HEMBB1 002661 MAMMA1 000106 M AM MA1 000204 
MAMMA1 000226 MAMMA1 000403 MAM MA1 000473 
MAMMA1 000496 MAMMA1 000528 MAMMA1 000591 

45 MAMMA1000614 MAMMA1 000681 MAMMA1 000706 

MAMMA1 000788 MAMMA1 000810 MAMMA1000814 
MAMMA1 000881 MAMMA1001043 MAMMA1001066 
MAMMA1001094 MAMMA1001150 MAMMA1001237 
MAMMA1001418 MAMMA1001532 MAMMA1001609 

50 MAMMA1001615 MAMMA1001623 MAMMA1001634 

MAMMA1001893 MAMMA1001957 MAMMA1001978 
MAMMA1 002070 MAMMA1 002080 MAMMA1 002091 
MAMMA1 002095 MAMMA1 002128 MAMMA1002142 
MAMMA1002165 MAMMA1 002234 MAMMA1 002586 

55 MAMMA1 002633 MAMMA1 003126 NT2RM1 000407 

NT2RM1000462 NT2RM1 000542 NT2RM1000580 
NT2RM1000789 NT2RM1 000855 NT2RM1000858 
NT2RM1000899 NT2RM2000410 NT2RM2000423 
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NT2RM2000497 NT2RM2000565 NT2RM2000582 
NT2RM2000589 NT2RM2000622 NT2RM2000632 
NT2RM2000773 NT2RM2001126 NT2RM2001558 
NT2RM2001626 NT2RM2001738 NT2RM2001767 

5 NT2RM2001792 NT2RM2001818 NT2RM2001902 

NT2RM2001939 NT2RM2001941 NT2RM4000100 
NT2RM4000198 NT2RM4000284 NT2RM4000295 
NT2RM4000326 NT2RM4000417 NT2RM4000444 
NT2RM4000587 NT2RM4000593 NT2RM4000648 

10 NT2RM4000761 NT2RM4000965 NT2RM4001377 

NT2RM4001735 NT2RM4001843 NT2RM4002352 
NT2RP1 000002 NT2RP1 000050 NT2RP1000181 
NT2RP1 000239 NT2RP1 000261 NT2RP1 000271 
NT2RP1 000300 NT2RP1 000325 NT2RP1 000465 

15 NT2RP1 000468 NT2RP1 000551 NT2RP1 000579 

NT2RP1000613 NT2RP1 000679 NT2RP1 000740 
NT2RP1 000981 NT2RP1001004 NT2RP1001020 
NT2RP1001031 NT2RP2000092 NT2RP2000178 
NT2RP2000240 NT2RP2000394 NT2RP2000447 

20 NT2RP2000514 NT2RP2000533 NT2RP2000610 

NT2RP2000616 NT2RP2000649 NT2RP2000663 
NT2RP2000694 NT2RP2000712 NT2RP2000739 
NT2RP2000818 NT2RP2000903 NT2RP2001200 
NT2RP2001223 NT2RP2001276 NT2RP2001388 

25 NT2RP2001469 NT2RP2001480 NT2RP2001495 

NT2RP2001514 NT2RP2001529 NT2RP2001538 
NT2RP2001562 NT2RP2001662 NT2RP2001755 
NT2RP2001769 NT2RP200181 7 NT2RP2001 878 
NT2RP2001903 NT2RP2001921 NT2RP2001948 

30 NT2RP2001956 NT2RP2002063 NT2RP2002188 

NT2RP2002232 NT2RP2002304 NT2RP2002409 
NT2RP2002510 NT2RP2002527 NT2RP2002533 
NT2RP2002564 NT2RP2002824 NT2RP2002942 
NT2RP2002974 NT2RP2002976 NT2RP2003042 

35 NT2RP20031 38 NT2RP20031 79 NT2RP200321 0 

NT2RP2003302 NT2RP2003369 NT2RP2003390 
NT2RP2003469 NT2RP2003545 NT2RP2003593 
NT2RP2003655 NT2RP2003664 NT2RP2003931 
NT2RP2003940 NT2RP2003950 NT2RP2004069 

40 NT2RP2004108 NT2RP2004141 NT2RP2004205 

NT2RP2004447 NT2RP2004606 NT2RP2004648 
NT2RP2004670 NT2RP2004794 NT2RP2004847 
NT2RP2005069 NT2RP2005163 NT2RP2005181 
NT2RP2005247 NT2RP2005378 NT2RP2005391 

45 NT2RP2005425 NT2RP2005535 NT2RP2005541 

NT2RP2005597 NT2RP2005632 NT2RP2005666 
NT2RP2005774 NT2RP2005878 NT2RP2005883 
NT2RP2005941 NT2RP2005994 NT2RP2006004 
NT2RP2006042 NT2RP2006092 NT2RP2006099 

50 NT2RP20061 34 NT2RP2006269 NT2RP200651 2 

NT2RP3000011 NT2RP3000022 NT2RP3000059 
NT2RP3000063 NT2RP3000125 NT2RP3000148 
NT2RP3000171 NT2RP3000172 NT2RP3000201 
NT2RP3000232 NT2RP3000304 NT2RP3000378 

55 NT2RP3000427 NT2RP3000436 NT2RP3000444 

NT2RP3000481 NT2RP300061 6 NT2RP3000645 
NT2RP3000652 NT2RP3000676 NT2RP3000677 
NT2RP3000721 NT2RP3000820 NT2RP3000838 
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NT2RP3000871 NT2RP3000907 NT2RP3000921 
NT2RP3001012 NT2RP3001061 NT2RP3001159 
NT2RP3001170 NT2RP3001195 NT2RP3001240 
NT2RP3001271 NT2RP3001322 NT2RP3001388 

5 NT2RP3001 542 NT2RP3001 560 NT2RP3001 592 

NT2RP3001650 NT2RP3001738 NT2RP3001754 
NT2RP3001976 NT2RP300201 5 NT2RP3002160 
NT2RP3002286 NT2RP3002311 NT2RP3002324 
NT2RP3002342 NT2RP3002353 NT2RP3002409 

10 NT2RP3002411 NT2RP3002448 NT2RP3002571 

NT2RP3002664 NT2RP3002737 NT2RP3002738 
NT2RP3002790 NT2RP3002836 NT2RP3002887 
NT2RP3002900 NT2RP3002958 NT2RP3002983 
NT2RP3003000 NT2RP3003076 NT2RP3003354 

15 NT2RP3003448 NT2RP3003473 NT2RP3003527 

NT2RP3003532 NT2RP3003614 NT2RP3003729 
NT2RP3003849 NT2RP3003874 NT2RP3003939 
NT2RP3004025 NT2RP3004067 NT2RP3004075 
NT2RP3004090 NT2RP3004119 NT2RP3004130 

20 NT2RP30041 33 NT2RP3004202 NT2RP3004294 

NT2RP3004309 NT2RP3004345 NT2RP3004406 
NT2RP3004481 NT2RP3004552 NT2RP3004557 
NT2RP3004625 NT2RP3004640 NT2RP3004647 
NT2RP4000108 NT2RP4000634 NT2RP4000962 

25 NT2RP4001009 NT2RP4001467 NT2RP4001877 

NT2RP4001879 NT2RP4002187 NT2RP4002451 
NT2RP4002750 OVARC1 000003 OVARC1 000090 
OVARC1000105 OVARC1000137 OVARC1 000255 
OVARC1 000275 OVARC1 000307 OVARC1000313 

30 OVARC1 000331 OVARC1 00041 0 OVARC 1000439 

OVARC1 000467 OVARC1 000529 OVARC1 000553 
OVARC1 000873 OVARC1 00091 6 OVARC 1000956 
OVARC1000995 OVARC1001030 OVARC1001049 
O VA RC 1 00 1 086 0 VAPC 1 00 1 1 32 OVARC 1 00 1 1 63 

35 OVARC1 001222 OVARC1001260 OVARC1001336 

OVARC1 001338 OVARC1001569 OVARC1001570 
OVARC1001596 OVARC1001607 OVARC1001725 
OVARC1001952 OVARC1001991 OVARC1 002058 
OVARC1002178 PLACE1000033 PLACE1000258 

40 PLACE1 000442 PLACE1 000740 PLACE1 000907 

PLACE1001016 PLACE1001114 PLACE1 001123 
PLACE1001231 PLACE1001340 PLACE1001401 
PLACE1001407 PLACE1001464 PLACE1 001500 
PLACE1001516 PLACE1001564 PLACE1 001655 

45 PLACE1001795 PLACE1001836 PLACE1001918 

PLACE1001949 PLACE1 002080 PLACE1 002095 
PLACE1002153 PLACE1 002329 PLACE1 002355 
PLACE1 002374 PLACE1 002547 P LAC E1 002726 
PLACE1 002905 PLACE1002911 PLACE1 002967 

50 PLACE1 0031 35 PLACE1 0031 63 PLACE1 003428 

PLACE1 003438 PLACE1 003460 PLACE1 003529 
PLACE1 003573 PLACE1 003598 PLACE1 003644 
PLACE1 003737 PLACE1 003772 PLACE1 003852 
PLACE1 004078 PLACE1004166 PLACE1004168 

55 PLACE1 004279 PLACE1 004441 PLACE1 004450 

PLACE1 004482 PLACE1 004492 PLACE1004519 
PLACE1 004520 PLACE1 004630 PLACE1 004648 
PLACE1004816 PLACE1 004887 PLACE1 005003 
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PLACE1 005031 PLACE1 005239 PLACE1 005383 
PLACE1 005426 PLACE1005519 PLACE1 005539 
PLACE1 005544 PLACE1 005569 PLACE1 005682 
PLACE1 005736 PLACE1 005745 PLACE1005815 

5 PLACE1 005878 PLACE1 005927 PLACE1 006071 

PLACE1 006073 PLACE1 006208 PLACE1 006277 
• PLACE1 006290 PLACE1 006443 PLACE100651 5 
PLACE1006716 PLACE1 006959 PLACE1 007028 
PLACE1 007077 PLACE1 007081 PLACE1 007096 

10 PLACE1 007296 PLACE1 007591 PLACE1 007702 

PLACE1007845 PLACE1007881 PLACE1008282 
PLACE1 008297 PLACE1 008359 PLACE1 008469 
PLACE1 008549 PLACE1 008657 PLACE1008716 
PLACE1 008744 PLACE1 008984 PLACE1 008985 

15 PLACE1 009279 PLACE1 009527 PLACE1 009546 

PLACE1009600 PLACE1009735 PLACE1010011 
PLACE1010078 PLACE1010081 PLACE1010251 
PLACE1010445 PLACE1010713 PLACE1010784 
PLACE1010827 PLACE1010968 PLACE1011045 

20 PLACE1011116 PLACE1011181 PLACE1011236 

PLACE1011364 PLACE1011407 PLACE1011516 
PLACE1011708 PLACE1011824 PLACE1011978 
PLACE2000118 PLACE3000181 PLACE3000213 
PLACE4000354 SKNMC1 00001 4 SKNMC1 000082 

25 THYR01 000061 THYR01 0001 96 THYR01 000400 

THYR01 000580 THYR01 000584 THYR01 000678 
THYR01 000776 THYR01 000795 THYR01 000846 
THYR01 000866 THYR01 000956 THYR01 000964 
THYR01 001 063 THYR01 001 071 THYR01 001 1 02 

30 THYR01 001 1 1 3 THYR01 001 1 28 THYR01 001 205 

THYR01 001 242 THYR01 001 266 THYR01 001 456 
THYRO1001457THYRO1001471 THYRO1001478 
THYR01 001 529 THYR01 001 593 THYR01 001 608 
THYRO1001641 THYRO1001700 THYRO1001702 

35 THYRO1001770THYRO1001803 Y79AA1 000030 

Y79AA1000127 Y79AA1 000207 Y79AA1 000270 
Y79AA1 000426 Y79AA1 000750 Y79AA1 000777 
Y79AA1 000876 Y79AA1 000888 Y79AA1 000967 
Y79AA1001062 Y79AA1001090 Y79AA1 001212 

40 Y79AA1001272 Y79AA1 001426 Y79AA1001523 

Y79AA1001727 Y79AA1001787 Y79AA1001799 
Y79AA1001803 Y79AA1001863 Y79AA1 002058 
Y79AA1002121 Y79AA1002129 Y79AA1002213 
Y79AA1 002334 Y79AA1 002376 Y79AA1 002378 

45 Y79AA1 002381 NT2RP2006580 



[0059] Among the clones, the following 83 clones are identical to the clones selected by the score in the ATGpr and 
the prediction by the PSORT for the existence of a signal sequence. 



50 HEMBA1 000907 NT2RM2000410 PLACE1 000740 

HEMBA1002164 NT2RM2001941 PLACE1001407 
HEMBA1 002421 NT2RP1 000050 PLACE1 002080 
HEMBA1003101 NT2RP2001495 PLACE1002095 
HEMBA1 004797 NT2RP2001948 PLACE1 002374 

55 HEMBA1 006335 NT2RP2002063 PLACE1 003428 

HEMBA1006572 NT2RP2002304 PLACE1003460 
HEMBA1 006707 NT2RP2003469 PLACE1 004482 
HEMBA1 006902 NT2RP2003593 PLACE1 005383 
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HEMBA1 007013 NT2RP2003655 PLACE1 005544 
HEMBB1 000447 NT2RP2003664 PLACE1 005569 
HEMBB1 000567 NT2RP2004447 PLACE1 005927 
HEMBB1 001200 NT2RP2006042 PLACE1 006277 

5 HEMBB1 002427 NT2RP2006269 PLACE1 006443 

MAMMA1 000591 NT2RP3000481 PLACE1 007096 
MAMMA1 000681 NT2RP3000645 PLACE1 007296 
MAMMA1001043 NT2RP3001012 PLACE1 008469 
MAMMA1001893 NT2RP3001195 PLACE1 008984 

10 MAMMA1001957 NT2RP3001560 PLACE1 008985 

MAMMA1002070 NT2RP3002160 PLACE1009527 
MAMMA1002165 NT2RP3002836 PLACE1010251 
MAM MA1 002633 NT2RP3002958 PLACE1011236 * 
NT2RP3003076 SKNMC1000014 

15 NT2RP3003354 THYP01 0001 96 

NT2RP3004133 THYR01 000795 
NT2RP3004309 THYRO1001478 
NT2RP4001879 THYRO1001702 
NT2RP4002451 Y79AA1 000270 

20 OVARC1 000439 Y79AA1 000426 

OVARC1 001 222 Y79AA1 001 803 
Y79AA1001863 



[0060] The 446 clones in Table 16, 17, 18, 19, and 20, and NT2RP2006580 are predicted to encode a secretory or 
25 membrane protein. Among them, 77 clones were identical to the clones selected by the score in the ATGpr and the 

prediction by the PSORT for the existence of a signal sequence (overlapping with any of the 254 clones listed in Table 

7-15). Besides, many clones were turned out to be identical to the clones selected as a protein associated with a 

glycoprotein. Also, there were clones identical to those selected as a protein associated with a disease. 

[0061] The 243 clones in Table 21 are predicted to encode a glycoprotein. Among them, 53 clones were identical to 
30 those selected by the score in the ATGpr and the prediction by the PSORT for the existence of a signal sequence. 

And, many clones were turned out to be identical to the clones selected as a secretory or membrane protein. Moreover, 

there were clones identical to those selected as a protein associated with a disease. 

[0062] The 51 clones in Table 22 are predicted to encode a protein associated to signal transduction. 

[0063] The 130 clones in Table 23 are predicted to encode a protein associated to transcription. 
35 [0064] The 1 7 clones in Table 24 are predicted to encode a protein associated with diseases. 

[0065] In these clones, 532 clones have the maximal ATGprl score of 0.5 or higher (Table 25). 60 clones have the 

maximal ATGprl score of 0.3 or higher and less than 0.5 (Table 26 and NT2RP2006580). And 67 clones were with the 

maximal ATGprl score of 0 or higher and less than 0.3 (Table 27). 

[0066] 532 clones shown in Table 25, each having the maximal score in the ATGprl 0.5 or higher, are very likely to 
40 be full-length and encode a secretory or membrane protein, or proteins associated to signal transduction, glycoprotein, 
transcription, or diseases. 59 clones in Table 26 and NT2RP2006580, which have the maximal score in the ATGprl 
0.3 or higher and less than 0.5, are likely to be full-length and encode a secretory or membrane protein, or proteins 
associated to signal transduction, glycoprotein, transcription, or diseases. 67 clones in Table 27, having the maximal 
score in the ATGprl 0 or higher and less than 0.3, are still likely to be full-length and encode a secretory or membrane 
45 protein, or proteins associated to signal transduction, glycoprotein, transcription, or diseases. 

[0067] This is the method for selecting the cDNA clones predicted to encode secretory and/or transmembrane pro- 
teins, glycoprotein-associated proteins, signal transduction-associated proteins, transcription-associated proteins, or 
disease-associated proteins on the basis of the partial sequences (5* sequences). 

[0068] The polynucleotide of the present invention encodes an amino acid sequence of a functional protein such as 
50 a secretory or membrane protein, or a protein associated to signal transduction, glycoprotein, transcription, or diseases. 
Since the protein has the complete amino acid sequence, it is possible to analyze its biological activity by expressing 
the protein as a recombinant protein using an appropriate expression system, or by raising and using an antibody 
which specifically recognizes it. 

[0069] It is possible to analyze the biological activity of a secretory protein or a membrane protein, or proteins asso- 
55 ciated to signal transduction, glycoprotein, or transcription, based on the methods in "Gene Transcription" (Hames B. 
D., and Higgins S.J. edit, (1993)), "Glycobiology" (Fukuda M., and Kobata A. edit, (1993)), "Growth Factors' 1 (McKay 
I., and Leigh I. edit, (1993)), "Extracellular Matrix" (Haralson M.A., and HassellJ.R. edit, (1995), "Transcription Factors" 
(Latchman D.S. edit, (1993)), "Signal Transduction" (Milligans G. edit, (1992)), featured in "The Practical Approach 
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Series" (IRL PRESS), or "Signal Transduction Protocols" (Kendall D. A. , and Hill S.J. edit, (1 995), "Glycoprotein Analysis 
in Biornedicine" (Hounsell E.F. edit, (1993)), featured in "Method in Molecular Biology" (Humana Press). 
[0070] As to a protein associated with a disease, it is possible to perform a functional analysis as described above, 
but also possible to analyze correlation between the expression or the activity of the protein and a certain disease by 
5 using a specific antibody that recognizes the protein. Alternatively, it is possible to utilize the database Online Mendelian 
Inheritance in Man (OMIM) (http://www.ncbi.nim.nih.gov/Omim/) , which is a database of human genes and diseases, 
to analyze the protein. New information is constantly being deposited in the OMIM database. Therefore, it is possible 
for one skilled in the art to find a new relationship between a particular disease and a gene of the present invention in 
the updated database. 

10 [0071] Proteins associated with diseases are useful in drug development as they can be utilized as a diagnostic 
marker, a drug that regulates the level of their expression and activities, or a target of gene therapy. Also, as for a 
secretory protein, membrane protein, or proteins associated with signal transduction, glycoprotein, or transcription, 
search of the OMIM with the keywords mentioned below revealed that the proteins are associated with many diseases. 
Also, relationship between a proteins associated to signal transduction or transcription and diseases is reported in 

15 "Transcription Factor Research-1999" (Fujii, Tamura, Morohashi, Kageyama, and Satake edit, (1999) Jikken-lgaku 
Zoukan, Vol.17, No.3), and "Gene Medicine" ((1999) Vol.3, No.2). Thus, not only a protein associated with diseases, 
but also a secretory protein, membrane protein, or protein associated with signal transduction, glycoprotein, or tran- 
scription is involved in diseases, suggesting these proteins also are very important as a target in medical industry. 
[0072] Keywords used in the search of the OMIM 

20 

(1) secretion protein 

(2) membrane protein 

(3) channel 

(4) extracellular matrix 
25 (5) receptor 

(6) glycoprotein 

(7) protein kinase 

(8) calmodulin kinase 

(9) transcription factor 

30 

[0073] Shown in the search result are only the accession numbers in the OMIM. Using the number, data showing 
the relationship between a disease and a gene or protein can be seen. The OMIM data has been renewed everyday. 

1 ) Secretion protein 

35 268 entries found, searching for "secretion protein" 

104760, 176860, 160900, 107400, 118910, 139320, 603850, 147572, 176880, 600946, 603215, 157147, 600174, 
151675, 170280, 179512, 179513, 138120, 1 79509, 246700, 179510,600626, 179511 , 600998, 109270,601489, 
154545, 179490, 185860,603216, 122559, 601746, 147290, 602672, 146770, 603062, 179508, 131230,601591, 
602421, 139250, 1 67805, 1 67770, 600041,600564, 118825,601146, 300090,600753, 601652,600759, 600768, 

40 602434, 1 82590, 6031 66, 308230, 602534, 603489, 1 07470, 1 50390, 1 0461 0, 1 731 20, 1 581 06, 1 43890, 306900, 

308700, 134797, 137350,227500, 176300, 107730,600760, 138079, 120180, 120160, 120150, 124092, 138160, 
101000,227600, 600509, 601199, 142410, 104311 , 1 93400, 201910, 107300, 122560, 272800,217000, 590050, 
147670, 133170, 176730, 300300, 134370, 274600, 120140, 162151, 158070, 

152790, 120120, 106100,300200, 192340, 190160, 138040, 147470, 147620, 173350, 147380, 152200, 152760, 
45 157145, 153450, 264080, 113811 , 600937, 600840, 188545, 202110, 600514, 186590, 603372, 136435, 137241, 

252800, 214500, 207750, 138850, 139191 , 142640, 138130, 189907, 603692, 600633, 603355, 107270, 600377, 
1 47892, 232200, 600281 , 232800, 602358, 1 37035, 601 771 , 601 769, 253200, 601 933, 1 1 8444, 600270, 1 20700, 
600945, 603732, 147660, 600761, 172400, 600823, 600877, 130080, 171060, 107740, 307800, 602843, 130660, 
1 52780, 1 24020, 601 1 24, 601 340, 601 604, 601 61 0, 1 71 050, 31 2060, 232700, 3001 59, 1 42703, 600734, 1 25255, 
50 1 68450, 1 2381 2, 1 88540, 1 47940, 1 88450, 600839, 1 82452, 1 88400, 1 82280, 1 76760, 263200, 600264, 1 88826, 

252650, 601185, 162641, 137216, 601398, 601538, 118888, 118445, 601745, 

190180, 601922, 182098, 602008, 147440, 602384, 600031, 109160, 602663, 151670, 602682, 602730, 602779, 
146880, 603061, 142704, 603140, 106150, 600732, 1 53620, 603318, 139392, 600042, 102200, 603493, 182100, 
264300, 603795, 184600 

55 

2) Membrane protein 

1017 entries found, searching for "membrane protein" 

130500, 305360, 153330, 173610, 1 70995, 1 09270, 1 70993, 309060, 120920, 602333, 133740, 133710, 602690, 



37 



EP1 130 094 A2 



133730, 159430, 600897, 133090, 601178, 602413, 602003, 109280, 603237, 602173, 107776, 602334, 125305, 
602335, 182879, 154045, 309845, 600594, 603718, 603241 1 603214, 603657, 603177, 600182, 601476, 602879, 
136950, 600723, 601114, 185880, 185881, 300096, 602257, 160900, 177070, 603062, 603344, 602977, 310200, 
600959, 300100, 186945, 600039, 600267, 128240, 182900, 601097, 136430, 600946, 602534, 601047, 143450, 
5 6031 41 , 603700, 600579, 256540, 1 59440, 60241 4, 600403, 602048, 1 88860, 1 37290, 1 58343, 1 84756, 60291 0, 

603179, 600279, 108733, 107770, 173335, 602625, 154050, 219800, 603850, 601028, 600447, 104225, 186946, 
601767, 603143, 121015, 603215, 227400, 603735, 600179, 602421, 180721, 

176801, 176860, 600753, 603142, 176790, 600266, 601239, 115501 , 143890, 121014, 121011 , 125950, 603534, 
304040, 601134, 600754, 601510, 601595, 190315, 300172, 602216, 602261 , 602262, 602461 , 131560, 1 79514, 

10 179512, 176981, 142461, 139310,312080, 176640, 128239, 185470,310300,601403,601757,273800, 151460, 

1 76943, 1 0431 1 , 1 68468, 1 201 30, 602887, 6001 64, 601 531 , 601 832, 1 04775, 600040, 603583, 1 76894, 602631 , 
166945, 182180, 120620, 141180, 601014, 139150, 182860, 177061, 600174, 180069, 191275, 104760, 601693, 
30001 7, 60351 8, 601 009, 1 34651 , 601 1 07, 1 03868, 6001 68, 1 36425, 603531 , 603291 , 60091 7, 60321 6, 1 02720, 
300118, 179590, 135630, 602285, 107450, 602296, 303630, 176878, 120090, 600322, 138160, 601212, 603293, 

15 131230, 112205, 600763, 600718, 300187, 170715, 601966, 300051, 602474, 

120070,600691,600855, 182309, 602101,602857, 194355, 162230,600874, 113730, 155550,602701,306400, 
601789,231200, 107271, 175100, 182870,305100, 301000, 601313, 1 57147, 147670, 139200,603593, 157655, 
600934, 1 55970, 602049, 1 55960, 1 55760, 1 1 8990, 1 35620, 308230, 602694, 1 62060, 300023, 1 60993, 1 5361 9, 
153432, 120131, 603823, 603167, 601023, 600816, 165040, 601681 , 166490, 300112, 120190, 300145, 163970, 

20 600772, 602926, 602933, 602202, 400015, 1 51 510, 600759, 602672, 602654, 603821, 116952, 151430, 602632, 

155975, 602217, 150370, 600752, 601179, 600932, 603048, 603234, 601805, 603822, 603869, 601717, 601181, 
313440, 139130, 107777, 109190,603452, 191163, 191164, 602370, 176877, 103195, 600523, 191328,601275, 
204200, 602426, 603820, 600551, 600695, 600552, 600553, 602306, 601523, 

602507, 602299, 600583, 114070, 600632, 603498, 185430, 600587, 235200, 173470, 603199, 601633, 602500, 
25 208900, 180297, 156225,516020, 190195, 141900, 102680, 193300, 101000, 193400, 300011 , 107400, 257220, 

107741, 180380, 203200, 111700, 600024, 304800, 600065, 110750, 179605, 113705, 601638, 222900, 120120, 
602509, 602469, 600930, 601383, 176261 , 602574, 602997, 311770, 131550, 603616, 308700, 603372,256100, 
224100, 276903, 305900, 516000, 131195, 314555, 601567, 603866, 306900, 103390, 186720, 173850, 601050, 
602505, 1 86590, 246530, 602689, 1 94380, 300041 , 1 62643, 1 52790, 1 201 50, 600682, 6001 06, 272750, 1 88040, 
30 602382,601497, 113811, 182138,212138, 601309, 109690, 114760, 176805,601253, 123900, 602581 , 189980, 

191190, 110700, 600163, 137167, 600580, 601610, 190000, 123825, 603491, 

6001 35, 1 86591 , 1 7391 0, 1 381 40, 1 07266, 1 20950, 601 081 , 603690, 244400, 31 2700, 1 71 060, 601 1 99, 601 758, 
170500, 277900, 601997, 314850, 601880, 603009, 120220, 603126, 164920, 602934, 164730, 163890, 603434, 
107269, 602909, 600877, 256550, 164761 , 602872, 120110, 126150, 158070, 266200, 223360, 250800, 269920, 

35 252650, 603355, 1 54582, 1 381 90, 300035, 602640, 227650, 1 581 20, 1 53700, 1 82380, 1 55740, 204500, 603401 , 

601975, 300135, 136350, 602924, 300167, 185050, 176100, 300189, 151525, 300200, 165180, 230800, 602158, 
602676, 603411, 193245, 120325, 601848, 192500, 603102, 147795, 245900, 137060, 147557, 120650, 602377, 
307800, 120930, 308100, 142800, 191092, 232300, 173510, 602225, 180470, 190930, 186357, 134638, 600544, 
601373, 600509, 600359, 603784, 600395, 600653, 603754, 601597, 601066, 

40 600185, 601295, 600978, 205400, 603274, 600418, 600839, 516050, 601691 , 601007, 600650, 600308, 603261, 

601 1 93, 600004, 60001 7, 51 6040, 253800, 276901 , 60001 9, 257200, 1 08780, 300037, 3001 04, 3001 26, 2551 25, 
203300, 300191, 426000, 302060, 304700, 201475, 252010, 193210, 311030, 306250, 248600, 191740, 108360, 
131244, 600423, 232200, 191305, 231680, 103320, 190180, 600493, 111200, 226200,312600, 600170, 111680, 
186910,203100,600536, 600238, 186830, 186760, 186745, 186711 , 106180, 111203, 103180, 182530, 182160, 

45 600644, 307030, 192321, 600667, 125647, 179080, 114207, 114860, 176000, 116930, 600748, 173515, 173325, 

600377, 171760, 171050, 118425, 170260, 191315, 600798, 600821 , 600823, 600444, 600840, 159465, 600857, 
158380, 600867, 154360, 152427, 150330, 110900, 147840, 147360, 147280, 

1 46880, 31 261 0, 1 20940, 1 42871 , 1 42790, 600937, 1 42600, 1 34390, 1 1 1 250, 600979, 600997, 1 42460, 1 86845, 
134635,601017, 139191 , 139090, 138850,601040, 138720, 122561 , 131100, 123610,217070, 100500,603377, 

50 602354, 603302, 603207, 603086, 602188, 602095, 603867, 603842, 603798, 602602, 601194, 602607, 603713, 

603681, 601252, 603648, 603646, 603644, 601282, 601284, 603667, 603712, 603594, 601872, 603425, 601843, 
603263, 603208, 601411, 603201, 603189, 601463, 603164, 603152, 603087, 602874, 601492, 602893, 602057, 
602859, 602746, 603879, 603510, 602458, 603380, 601581 , 603765, 603283, 601599, 601733, 601852, 602316, 
601615, 601617, 602184, 602894, 603005, 603030, 603861 , 602835, 602136, 600153, 600074, 600046, 600023, 

55 601 625, 51 6006, 60001 8, 60001 6, 51 6002, 601 590, 31 3475, 31 3470, 600244, 

600528, 601611 , 600282, 600327, 601568, 600368, 601730, 601535, 601745, 601929, 300169, 300150, 300132, 
601533, 600385, 600464, 600424, 600429, 601756, 601488, 516005,251100, 516004, 600918, 516003, 602192, 
51 6001 , 240500, 600465, 602241 , 602243, 230200, 601 485, 601 478, 601 41 6, 602297, 601 459, 601 839, 60231 4, 
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193065, 193001, 191306, 600504,601020, 191191,602372, 190181,600534, 188380, 186854,186360,600530, 
185250, 182331 , 600535, 182305, 601296, 600582, 600732, 600734, 600742, 600782, 176802, 176266, 600769, 
601 883, 600864, 601 901 , 1 76260, 1 73490, 60091 0, 601 905, 1 71 890, 60091 6, 601 987, 602679, 1 62651 ,161 555, 
160994, 602714, 602715, 602724, 602736, 300007, 602783, 275630, 602836, 270200, 602871 ,159460, 602876, 

5 154540, 153900, 602890, 601153, 602190, 602905, 153634, 153337, 602914, 

152310, 151690, 151625, 602935, 602974, 150325, 602992, 150320, 250790, 603006, 603007, 603008, 150292, 
233690, 603046, 150210, 603061 , 147940, 603063, 221770, 223100, 603097, 147880, 603118, 147730, 146928, 
146630, 142622, 603149, 603150, 603151, 600923, 138981, 138590, 138330, 216950, 603192, 138297, 603202, 
601 002, 602343, 1 38230, 1 361 31 , 60321 7, 603220, 1 34660, 1 31 390, 1 31 235, 603242, 603243, 1 301 30, 602345, 

10 126455, 601123, 126064, 125240, 602359, 603312, 602380, 603318, 123890, 123836, 603356, 603361 , 603366, 

123830, 179610, 188060, 123620, 120980, 186355, 118510, 114835, 114217, 113810, 603499, 182310, 111740, 
109610, 603548, 603564, 108740, 603598, 603613, 107273, 603626, 602518, 179410, 603647, 602515, 603652, 
106195, 602573, 178990, 105210, 104615, 167055, 603717, 104614, 603728, 

104210, 603749, 603750, 103850, 602608, 603787, 603788, 603796, 173445, 103220, 102910, 102681 , 102670, 
15 102642, 603833, 173391, 102576, 102575, 171833, 102573, 101800, 603875, 601108 

3) Channel 

272 entries found, searching for "channel" 

176266, 600724, 170500, 182390, 123825, 114208, 114205, 601784, 114206, 600937, 114204, 603415, 600053, 
20 114209, 114207, 600760, 118425, 601011, 192500, 176261, 600761, 176260, 600359, 600228, 600877, 602235, 

300008, 182389, 182391 , 601328, 601534, 600504, 602323, 601958, 602780, 602781, 601327, 601012, 600734, 
603208, 182392, 603220, 603219, 603888, 600054, 602232, 601745, 603537, 602604, 603796, 302910, 602866, 
601013, 602905, 602906, 600163, 152427, 180901 , 600702, 600308, 602754, 107776, 602024, 314555, 601949, 
600235, 602023, 1 76263, 600681 , 1 76265, 1 93245, 603305, 1 76258, 602983, 601 21 9, 601 1 41 , 1 76267, 602343, 
25 602726, 138253, 176262, 600003, 600397, 602872, 138249, 600843, 600935, 600580, 600845, 602158, 602106, 

176264, 300110, 176257, 602717, 603493, 176268, 600932, 602727, 138254, 

603652, 300138, 602420, 600570, 600150, 603583, 602345, 603749, 601142, 176256, 600846, 138252, 602982, 
603787, 602836, 603788, 602566, 603651 , 602421 , 1 00690, 1 07777, 1 00725, 1 0071 0, 600509, 603061 , 1 54275, 
304040, 154276, 180902, 121014, 602368, 139311 , 601383, 108745, 601313, 601042, 600131 , 186360, 600109, 

30 600229,600170,603319,601485,118503, 180903,602076, 124030,601059,601212,601218,147450,600855, 

600919, 601154, 601157, 171060, 600968, 182139, 131230, 121015, 600421 , 113730, 249210, 310500, 600637, 
125950, 118800, 156490, 602974, 104610, 121011 , 602522, 118504, 300041 , 1 60900, 601382,602103, 600465, 
602014,600442, 601109, 602481,277900, 254210, 138247, 164920, 170280, 171050, 128100, 173910, 600884, 
123885, 602887, 600232, 180297, 137192, 600304, 138251 , 603053, 300103, 

35 603152, 603199, 118511, 118508, 138079, 600983, 182307, 603324, 305990, 603418, 114080, 232200, 600046, 

600040, 602403, 603750, 603785, 104210, 600019, 600300, 182860, 603852, 603853, 603855, 516060 

• 4) Extracellular matrix 

167 entries found, searching for "extracellular matrix" 

40 603479, 602201 , 601 41 8, 601 548, 1 54870, 1 1 5437, 602285, 602262, 602261 , 1 34797, 600754, 1 20361 , 1 1 6935, 

602263,603320,601807,603321, 185250, 185261,253700, 128239, 120324, 1 93300, 276901 , 308700, 600514, 
600261,602109, 120140, 120150, 147557, 193400, 600536, 188826, 120180, 118661, 120320, 152200, 135821, 
112260, 230740, 602090, 155760, 1 92975, 190182, 602108, 601463, 186745, 600900, 600985, 600758, 602369, 
179590,601211,600065, 602178, 600262, 182888, 182889, 151510, 182120,150325, 190181 , 150370, 186355, 

45 193065, 165070, 154705, 147559, 146650, 146640, 1 53619, 175100, 187380,231050, 188060, 135820, 156790, 

130660, 301870, 128240, 600076, 600119, 193210, 600215, 600245, 121010, 1 50240, 600309, 600491 , 222600, 
120328, 600564, 600596, 600616, 600700, 600742, 120325, 138297, 600930, 

156225, 601028, 601050, 601105, 253800, 601284, 601313, 120280, 310200, 601492, 120250, 601587, 601636, 
601652, 601692,601728, 120220, 601915, 602048,155120,310300, 120210, 120165, 120120, 118940, 116930, 
50 602264, 1 1 6806, 602366, 1 20470, 60241 5, 602428, 602453, 602505, 602574, 603005, 6031 96, 603221 , 60331 9, 

107269, 216550, 103320, 603489, 603551, 603767, 603799, 603842 

5) Receptor (including membrane proteins, and also including transcription factors, since nuclear proteins were 
not excluded in the search) 
55 1 606 entries found, searching for "receptor" 

600408, 176943, 107770, 601531 , 143890, 313700, 1 62643, 202200, 147670, 191306, 182131 , 192321 , 138249, 
190120, 600264, 162321 , 603613, 603614, 602164, 138251 , 182134, 138040, 152790, 602393, 133430, 601769, 
304800, 147280, 168468, 147730, 155555, 191190, 109760, 122561, 136537,602034, 147138, 187930,312861, 
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107450, 138981, 182135, 603612, 191191 , 600933, 601267, 600168, 120650, 182133, 
1 02576, 600022, 603366, 600052, 1 36430, 601 835, 600825, 600979, 1 02981 , 1 92974, 
601 972, 1 91 31 5, 602601 , 60361 1 , 602943, 601 300, 1 91 31 6, 1 08960, 601 299, 1 4781 0, 
602392,602910, 603372, 106165,601199, 180901, 131244, 107470, 136435, 146740, 
109690, 100690, 138850, 118504, 147781, 146928, 601373, 126449, 
104225, 100725, 164342, 104220, 118503, 136538, 137241, 601663, 601805, 306250, 
600983, 126110, 118507, 602548, 1 62010, 601268, 147671, 601284, 113503, 109190, 
600956, 138245, 139139, 155541 , 137192, 118445, 104250,300119, 118511 , 6011 59, 
114610, 138253, 146929, 602778, 135630, 147140, 146790, 147045, 601998, 158378, 
137142, 113995, 138247, 182137, 147811, 182132, 138248, 600414, 600563, 137160, 
138254,601600, 173800, 146760, 179590, 180903, 109635, 300038, 138244, 182139, 
602282, 1 7961 1 , 1 43090, 1 0071 0, 601 076, 600380, 1 70998, 600845, 1 1 8505, 1 1 851 0, 
137140, 600456, 147880, 137190, 601156, 601956, 100730, 601194, 602717, 
173880, 600042, 180902, 300093, 602423, 602469, 603028, 601437, 146933, 179610, 
137143, 603361,603031, 137141, 136539, 162332, 118509, 155540, 600233, 601109, 
131235, 603506,603455, 151445, 602408, 601681, 603540, 602454, 602490, 600214, 
600730, 601122, 120577, 160994, 603248, 602621 , 601305, 176946, 118502, 176888, 
601470, 603453,603065, 137163, 137166, 600846, 118495, 138243, 192975, 138060, 
120340,600315, 136425, 1 18493, 600731 , 601 502, 601503, 601751 , 603651 , 602345, 
1 1 8494, 1 86880, 1 34934, 601 007, 600946, 1 86930, 1 86970, 1 331 71 , 308380, 1 20620, 
186810, 180220, 173410, 188070, 131550, 109691, 147679, 180240, 173490, 
300034,264080, 114131 , 146661 , 147370, 147545, 136352, 190182, 186780, 126453, 
1 26451 , 1 28239, 60001 8, 601 487, 601 040, 305990, 601 604, 1 901 81 , 1 88545, 1 62641 , 
601 586, 1 64770, 600073, 602376, 1 38491 , 1 1 8946, 1 76761 , 1 76806, 601 770, 1 80245, 
109610, 192977, 159530, 602997, 164860, 300007, 146710, 601757, 150370, 118444, 
108360, 162642, 1 73850, 603881 , 1 80246, 193210, 173470,601469, 134935, 182452, 
601 790, 600221 , 1 09630, 602451 , 1 47265, 602626, 602001 , 602069, 601 642, 600848, 
176886, 602250, 182454, 600222, 600821, 180247, 602730, 134637, 167055, 165196, 
600527, 600167, 153618, 601330, 601970, 600785, 600004, 601043, 109135, 
1 09636, 601 61 3, 600804, 601 529, 1 02581 , 600936, 1 261 50, 1 82451 , 3001 07, 1 901 51 , 
600421 ,100997, 602648, 603243, 300035, 305670, 182455, 162651 , 109535, 601320, 
601535, 102775, 601166, 180381 , 176804, 602188, 600051, 601993, 601773, 603755, 
600253, 300130, 137026, 602368, 154570, 176945, 602005, 120980, 602583, 102777, 
601 71 1 , 1 0422 1 , 1 76884, 600441 , 1 08361 , 600665, 600446, 600799, 600742, 1 0421 9, 
601978, 600578, 602566, 600551 , 601898, 151443, 602642, 300182, 191164, 602356, 
182889, 142703, 601576, 100735, 176882, 602969, 182453, 601895, 602886, 602927, 
601108, 176891, 176802, 600024, 600552, 601908, 176981 , 603510, 151460, 
602237, 602545, 1 62341 , 603748, 1 61 565, 600894, 1 8671 1 , 1 371 61 , 601 839, 601 592, 
104222, 601116, 602643, 601896, 601404, 601381, 600180, 600870, 123889, 600206, 
602950, 602174, 603159, 602042, 602297, 602836, 602355, 602304, 603822, 147267, 
603820, 60331 7, 601 965, 601 524, 601 974, 603823, 602728, 601 741 , 1 37570, 603500, 
186990, 601115, 600283, 1 82180, 600600, 600066, 600282, 176944, 600926, 600895, 
600925, 182888, 603808, 603029, 602702, 602464, 603751, 425000, 601910, 151020, 
601909, 603692, 136950, 600543, 1 64320, 300086, 602933, 602993, 137162, 603281, 
600144, 600869, 600493, 602346, 600342, 602336, 113955, 602893, 601577, 
603749, 602892, 603810, 602853, 600579, 601044, 603809, 186690, 600752, 603304, 
602894, 602337, 603143, 109545, 602891 , 602890, 602716, 603142, 602510, 603144, 
603145, 107741, 107730, 1 64761 , 208900, 107269, 114208, 235200, 158120, 147440, 
151520, 131240, 139250, 601667, 1 53240, 218030, 147680, 186740,600044, 107940, 
601 21 8, 1 42409, 601 1 56, 601 884, 601 789, 1 25950, 1 71 833, 602221 , 1 39320, 1 47760, 
165180, 166490, 168470, 601366, 176000, 191160, 102578, 103320, 112260, 192240, 
254210, 173610, 125480, 600957, 118945, 120470, 136351, 600725, 176797, 602887, 
252500, 104615, 601528, 600345, 601890, 151390, 602170, 186790, 186960, 
600542, 186973, 600423, 600058, 1 76730, 186945, 104760, 601309, 601500,300091, 
600726, 176885, 164875, 1 39330, 1 04640, 147840, 600065, 131242, 164160, 600565, 
180860, 190040, 147720, 222100, 223900, 600555, 600994, 186770, 120420, 602004, 
1 32890, 600007, 1 581 05, 603886, 603884, 1 76830, 1 62200, 602458, 601 603, 1 0791 0, 
114855, 601498, 601511 , 116831 , 131390, 107773, 137800, 155750, 147795, 264600, 



601919,108962,162323, 
602779,601828,603499, 
602909, 601834,108961, 
164920,139191,138971, 



600377, 
603826, 
100710, 
131243, 
138032, 
600849, 
190160, 

173510, 
601743, 
600798, 
602167, 
602757, 
118496, 
126452, 

110700, 
600445, 
164870, 
600239, 
138492, 
102776, 
165195, 

602746, 
120920, 
600241, 
600524, 
603819, 
603071, 
602697, 

603507, 
600242, 
603030, 
602592, 
300188, 
600553, 
153243, 

600240, 
603141, 
300200, 
153390, 
190000, 
109091, 
173511, 



600337, 
154540, 
305660, 
600041 , 
601496, 
600644, 
104260, 

600843, 
600409, 
600232, 
138246, 
191311, 
601598, 
190010, 

147851, 
231200, 
602691, 
138033, 
135620, 
180190, 
173391, 

165070, 
603411, 
601050, 
602848, 
600867, 
602190, 
601070, 

602885, 
602992, 
603821 , 
153622, 
602657, 
603195, 
161555, 

603232, 
300090, 
161561, 
102582, 
184757, 
603233, 
123910, 



600509, 
147569, 
104210, 
118508, 
138252, 
601296, 
137164, 

305915, 
147139, 
602729, 
158050, 
603652, 
602343, 
126450, 

308385, 
136350, 
600939, 
601167, 
601523, 
182098, 
601426, 

170650, 
603691 , 
603659, 
173393, 
602351 , 
601522, 
120830, 

430000, 
601380, 
602043, 
601204, 
602756, 
602646, 
600011, 

600868, 
602550, 
186830, 
600978, 
601512, 
134370, 
601459, 



113705,214500,602731, 
601211, 151530,603599, 
601099,603776,601212, 
193400,133100,160900, 
114207, 244400, 187040, 
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126455, 164040, 147683, 263200, 188230, 184756, 147574, 179615, 176960, 176947, 147561 , 131320, 107265, 
147470, 264700, 173335, 124010, 147050, 132810, 230350, 276000, 114130, 1 55730, 134638, 254150, 122560, 
186940, 208150, 188040, 188400, 100600, 601564, 603875, 603849, 601497, 

603254, 603180, 602952, 602775, 602584, 602570, 601045, 600632, 602559, 601883, 600618, 601848, 601766, 
5 601 753, 600554, 600937, 600694, 311 360, 30291 0, 6001 73, 3001 39, 300065, 274000, 3051 00, 3001 1 6, 300022, 

163800, 187410, 187020, 603818, 105850, 107941, 307800, 601897, 212895, 603598, 603584, 603583, 603454, 
60341 2, 601 872, 1 1 4204, 266300, 1 1 4841 , 1 1 6897, 1 52690, 603261 , 6031 30, 6031 09, 603061 . 603055, 603043, 
601771 , 154275, 600910, 187011 , 188060, 600039, 600464, 186355, 121360, 122559, 123000, 253800, 602901, 
125255, 131100, 600837, 602772, 600835, 131530, 602695, 137580, 602623, 139130, 249210, 142408, 602457, 
10 6001 79, 1 42445, 1 54276, 245480, 1 42800, 600998, 602354, 601 698, 602272, 601 053, 602265, 231 550, 229300, 

147780, 601985, 147796, 226200, 600276, 147830, 150200, 601937, 189906, 

601 922, 600341 , 1 00650, 601 662, 1 89965, 1 59440, 1 90020, 601 054, 600764, 60041 0, 1 53440, 205400, 601 055, 
203800, 600576, 600749, 1 90090, 601 767, 600475, 1 90080, 1 54550, 1 84745, 601 721 , 601 71 2, 600594, 600739, 
600738, 180069, 601020, 601679, 162662, 601266, 601676, 601203, 172400, 601463, 600685, 601 599, 164009, 

15 60061 1 , 601 570, 600626, 600201 , 60051 4, 1 91 1 00, 1 95002, 600567, 600633, 1 91 030, 1 90700, 60001 4, 303700, 

308210, 308230, 191170, 269700, 1 91092, 192130, 600319, 238600, 195000, 600125, 600577, 261680, 600566, 
266100, 309900, 266600, 306900, 600536, 194070, 201910, 188860, 188595, 257220, 600038, 202110, 261000, 
600363,600850,306100,602229,603883, 102645, 186921 , 186910, 603785, 186852, 104311 , 107266, 107270, 
603746, 603732, 109270, 602060, 602041, 148180, 151385, 151625, 123830, 

20 182530, 123842, 124030, 134660, 153370, 134850, 601023, 182280, 134797, 182125, 134390, 601047, 135820, 

601804, 153420,601801, 139312, 139392, 181031,601622, 180700, 147570,603157, 162150,601589, 121013, 
151440, 602919, 151510, 123810, 123890, 131210, 179730, 602663, 179020, 602630, 601231 , 601252, 137295, 
138190, 138550, 277700, 601295, 139311, 153455, 176311 , 176310, 601313, 133170, 141800, 142995, 602358, 
1 431 00, 1 45505, 300079, 1 68461 , 1 45981 , 1 58070, 1 06700, 1 07400, 1 07720, 601 485, 1 03950, 1 08330, 1 65360, 

25 300300, 156490, 603492, 602018, 118485, 164951 , 164730, 164720, 147620, 603406, 164343, 119530, 161560, 

142910, 602414, 601933, 120290, 164013, 164012, 600667, 601534, 601513, 601520, 164015, 164340, 173515, 
164975, 302020, 160782, 601501, 165230, 302060, 165120, 601492, 601483, 

1 67040, 1 6741 5, 60071 4, 1 92968, 600696, 601 458, 601 447, 1 73325, 1 68820, 1 76820, 601 41 3, 1 901 98, 600489, 
173445, 601754, 600481 , 3001 55, 600453, 1 54230, 1 73430, 154030, 173360, 181590, 176876, 176970, 178600, 

30 164005, 182160, 274500, 179616,275120, 151990,300147,601863,211750,275355, 151675, 190170, 179838, 

600750, 180200, 180231, 151430, 600753, 601582, 150240, 601584, 601153, 224500, 601154, 601969, 180440, 
600596, 600281 , 600262, 227400, 601 997, 1 47684, 1 47678, 600761 , 1 47581 , 60201 4, 602024, 1 47573, 601 1 50, 
159555, 147559, 600945, 191163, 602065, 147558, 601641, 147450, 602096, 602113, 600583, 602154, 600556, 
1 59350, 1 47245, 1 86855, 1 92020, 1 91 51 0, 1 46631 , 1 46630, 1 80950, 601 693, 1 46255, 602260, 238300, 600231 , 

35 602293, 194540, 602299, 145680, 602311, 145260, 600230, 602350, 239100, 

600508, 1 43055, 601 723, 1 4301 0, 1 55960, 601 377, 602382, 275350, 600220, 602396, 304050, 600424, 602421 , 
1 42461 , 602430, 602431 , 1 42460, 246530, 273300, 2491 00, 602461 , 602465, 1 41 850, 601 788, 602498, 1 4021 0, 
1 81 460, 600422, 3001 27, 6001 64, 600765, 602582, 1 38800, 3001 01 , 1 38720, 1 38420, 602609, 6001 57, 1 38079, 
138030, 600156, 601017, 137240, 602671 , 602680, 602682, 136530, 182090, 600807, 601826, 602724, 153245, 

40 182331, 210250, 600151, 602738, 131340, 131241 , 300008, 131195, 600119, 602809, 602839, 602840, 128240, 

126850, 126065, 252650, 253200, 188380, 600810, 600396, 123841 , 182810, 123835, 600053, 217030, 602921, 
602931 , 602932, 601 900, 600050, 1 85430, 1 85520, 1 21 800, 254770, 259700, 222300, 1 20940, 1 20700, 1 20436, 
120360, 603033, 600295, 603035, 262600, 603046, 600009, 263400, 120150, 

263520, 6031 26, 590085, 1 85605, 1 1 8960, 1 1 8940, 1 1 8455, 6031 49, 6031 77, 30001 9, 6031 81 , 1 1 8425, 603207, 
45 426000, 116955, 116940, 116930, 300088, 603264, 603272, 603274, 116899, 603302, 313650, 266265, 603331, 

114835, 603356, 114830, 312760, 603378, 602061, 310400, 603414, 603437, 113730, 310200, 113710, 147557, 
113000, 112205, 111400, 111100, 110750, 603518, 603526, 602136, 603576, 309845, 603594, 308310, 108728, 
108355, 600250, 307200, 603628, 107777, 602165, 603653, 600899, 107273, 107271 , 603695, 603705, 603717, 
186590, 186720, 186760, 186820, 187700,600916, 147183, 105590, 104610, 146770,306400,602227, 103390, 
50 1 03280, 1 03070, 1 02670, 301 000, 603851 , 600874, 600871 , 1 02200, 1 58380, 603889 

6) Glycoprotein 

438 entries found, searching for "glycoprotein" 

231200, 273800, 138700, 138600, 173510, 173515, 192974, 109770, 138610, 603578, 195002, 182889, 601275, 
55 602977,300051, 173470, 171050, 138680, 142640, 194460, 176390, 138720,600119, 185430, 138670, 180297, 

171060, 600073, 173335, 128239, 118850, 602616, 110700, 164345, 159465, 159460, 191845, 173511, 600564, 
159440,256550, 153310, 168810,600738, 151460, 138470, 190197,603586,601325, 601193, 162820, 158120, 
1 55970, 1 90920, 603201 , 603594, 601 41 1 , 1 55735, 1 86730, 601 028, 1 58030, 1 76391 , 1 1 5501 , 1 37290, 601 1 03, 
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600315, 176870,601525,601418, 137061, 137207, 137060, 176397, 176393, 176394, 182888, 160995, 176398, 
600718, 176396, 176392, 176399, 176401, 176395, 125645, 601774, 104175, 195000, 601785, 193400, 600065, 
111200, 253700, 152200, 154550, 310200, 188230, 256540, 120980, 130660. 

107269, 158070, 156790, 131550, 135630, 138297, 155760, 231050,603381, 143890, 176801, 118860, 103600, 
5 1 9321 0, 1 86820, 1 1 4890, 1 88060, 1 1 891 0, 1 07271 , 1 53340, 31 3470, 600074, 603062, 1 09270, 602724, 603356, 

120324, 192340, 266265, 110900, 306400, 601327, 118457, 151250, 136836, 253000, 104640, 190010, 190000, 
135620, 135600, 601852, 134797, 134370, 111250, 156225, 600596, 187430, 164870, 151510, 186720, 601962, 
600900, 151530, 180990, 180290, 182120, 603759, 179605, 158381, 158373, 158343, 155740, 153420, 151675, 
142800, 176300, 153240, 140100, 134820, 134660, 134390, 125240, 120520, 112205, 111740, 106100, 104225, 
10 103000, 139090, 176860, 102720, 176895, 180621, 602421, 602257, 252600, 252650, 180620, 227600, 228960, 

208900, 182145, 208400, 602643, 602593, 602461, 602171, 182900, 601269, 

600270, 600033, 602894, 247100, 186910, 600239, 230500, 217800, 600867, 253220, 601581 , 305600, 162060, 
158378, 188450, 186880, 186990, 188040, 1 54582, 164022, 188540, 153634, 189965, 193190, 165095, 165220, 
158375, 152790, 158374, 151627, 182205, 1 55750, 191316, 151523, 151020, 165330, 166490, 192225,224050, 

15 224100, 185490, 192240, 173445, 167770, 1 85050, 153619,230800, 147740, 147730, 147670, 153618,236200, 

193300,249000, 176385, 153440, 147557, 1 52310, 253270, 184900, 182331, 146760, 146661,272300,272800, 
142930, 169615, 171640, 182520, 182310, 300027, 147780, 305370, 151525, 142858, 142840,308230,308385, 
309060, 142445, 142290, 216950, 139130, 600026, 600044, 600046, 138971, 138960, 138140, 600133, 600182, 
600215, 173350, 173360, 135820, 600403, 600410, 600491, 134934, 147563, 

20 600535, 133700, 600587, 133170, 131390, 600616, 600713, 131195, 600722, 130160, 600742, 600751 , 600764, 

600769,600798, 180540, 600978, 129010, 601040, 601050, 125505, 125490, 124010,601194, 601211,601253, 
147380, 120930, 601313, 120830, 120620, 601365, 601368, 120220, 601456, 601484, 117700, 147045, 601652, 
274500, 116930, 116880, 601788, 116805, 114250, 113810, 601880, 601914, 601932, 111700, 602046, 602053, 
602108, 602243, 176820, 178630, 178640, 111680, 300017, 109535, 142860, 602692, 109530, 602746, 308840, 

25 1 07740, 1 07400, 6031 04, 6031 25, 6031 26, 6031 30, 1 07273, 6031 97, 1 07266, 603241 , 603243, 1 0461 4, 31 2080, 

104610, 603492, 103390, 102670, 102480, 100735, 603665, 603756, 100730, 603777 

7) Protein kinase (a member of signal transduction) 
729 entries found, searching for "protein kinase" 

30 600289, 160900, 601032, 176948, 600899, 1 76947, 176894, 601939, 602980, 601314, 601955, 601335, 600447, 

602130, 603606, 601158, 602006, 602549, 602678, 176872, 603014, 602520, 186945, 601254, 602315, 601263, 
1 77061 , 1 76873, 600497, 603607, 602449, 602740, 602898, 602741 , 300083, 602899, 1 1 4080, 60341 2, 601 1 84, 
602739, 176910, 600050, 176949, 602741, 602743, 600341, 601951, 601212, 600038, 156490, 600855, 600758, 
603455, 1 56491 , 603424, 300075, 1 80381 ,15151 0, 6001 68, 600085, 601 890, 602896, 602 1 23, 600221 , 602904, 

35 602687, 1 76945, 602897, 600222, 602098, 601 685, 1 76944, 1 09635, 601 795, 601 679, 603722, 602521 , 601 436, 

191164,602004, 176942,601684, 300101 , 601 299, 601434, 602615,601435, 176960, 153390, 603166, 179090, 
300300, 176871, 602216, 176912, 176893, 603453, 300185, 177060, 176943, 

1 76981 , 603441 , 1 76970, 1 1 6899, 603442, 31 1 550, 603495, 1 691 90, 600448, 602399, 1 76946, 1 76977, 1 88830, 
602350, 601591, 601782, 137026, 400008, 300094, 600058, 600831 , 176892, 601639, 602919, 176911, 176975, 

40 3001 72, 603072, 600583, 1 7701 5, 1 76982, 1 76980, 603248, 1 76941 , 602757, 600870, 603440, 1 48750, 601 959, 

600869, 602448, 600408, 306000, 266200, 602539, 602425, 600160, 310200, 208900, 600997, 186973, 305360, 
1 7961 1 , 307030, 603262, 601 050, 602758, 1 47795, 31 1 800, 600664, 600441 , 603251 , 600527, 31 3650, 1 23831 , 
191315, 602958, 1 23832, 602989, 60261 4, 1 23828, 1 36350, 601 284, 602940, 601 329, 6001 73, 1 71 834, 1 24095, 
603464, 603584, 116953, 300035, 600600, 109135, 601232, 601988, 603850, 603384, 603184, 602990, 125855, 

45 601047, 603460, 603258, 602748, 603723, 164870, 603368, 139310, 601818, 

602913, 603005, 602221 , 173570, 147670, 602610, 600524, 113508, 251170, 142370, 142408, 191306, 147796, 
118423, 600286, 601074, 600765, 600922, 179610, 602747, 601953, 601132, 600136, 601288, 601289, 602297, 
191316, 601535, 602925, 125450, 602839, 602168, 602461, 105590, 114085, 600714, 154235, 603261, 109270, 
191305, 602625, 602366, 602749, 600927, 300134, 603496, 136352, 176790, 600073, 601523, 11 5441 , 191170, 

50 114210, 116900, 188250, 102576, 603764, 603497, 603304, 601934, 602255, 600004, 600526, 601589, 600856, 

601269, 601465, 601207, 601935, 600863, 602394, 601983, 602048, 300200, 179050, 164761 , 189980, 602626, 
1 37025, 602694, 602731 , 1 25305, 1 2381 0, 3001 39, 601 792, 275350, 1 71 833, 1 93525, 601 540, 1 62200, 601 826, 
116898, 601108, 601528, 600954, 602609, 602838, 603167, 601522, 600505, 

600431,600267, 188555, 600140, 308240, 600137, 600011, 151410, 601530, 134934, 116952, 300142, 191311, 
55 601603, 603889, 601728, 165160, 165070, 136351, 601836, 601839, 133090, 601595, 602745, 602516, 154950, 

603601 , 602052, 154045, 603583, 600695, 602933, 602756, 601014, 602474, 602887, 172270, 601366, 180220, 
602189, 114105, 600963, 603369, 601296, 603289, 602265, 602337, 600066, 600456, 602387, 600917, 139270, 
603271 , 188345, 601231, 115440, 115442, 600778, 164920, 175100, 310300, 601767, 603168, 603140, 603113, 
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116940, 162060, 603048, 603275, 601146, 600434, 602336, 602310, 602308, 147522, 603311, 227645, 114110, 
601114, 300022, 164785, 167414, 123280, 600007, 600560, 229300, 151430, 176740, 602426, 109636, 602775, 
108355, 601999, 601524, 190151 , 202500, 601592, 600723, 102750, 600753, 

603618, 603814, 147521, 602514, 603015, 602525, 602524, 172471 , 602647, 602590, 600864, 172470, 602538, 
602527, 602526, 143890, 164730, 232300, 131550, 116951, 601713, 154550, 603434, 138160, 601702, 126335, 
272800, 601 497, 602038, 1 07940, 1 91 1 90, 273800, 1 93300, 1 201 50, 603068, 603259, 1 02582, 601 099, 253700, 
308000, 602338, 602689, 185605, 164040, 600119, 152690, 601496, 602188, 1 23900, 602860, 164960, 600618, 
601045, 300189, 164860, 602505, 230800, 141850, 147545, 110300, 600051 , 138600, 233700, 109700, 238600, 
109690, 253800, 311770, 156225, 603719, 191030, 309000, 601893, 190030, 189972, 601247, 190120, 600179, 
258501 , 1 90040, 3001 21 , 1 80380, 31 4850, 600438, 306400, 1 7971 5, 1 90450, 600636, 1 80901 , 1 73335, 601 290, 
154500, 131195, 115460, 603078, 603775, 603760, 158900, 602354, 102680, 

122560, 602382, 103320, 603597, 603510, 232600, 133170, 603353, 603345, 603318, 603309, 230400, 139250, 
603296,603272, 114240, 147557,232400, 139259, 602122, 230200, 164014,248600, 601703, 194364, 150000, 
602102, 601683, 158070, 602170, 602007, 601306, 248610, 601387, 161770, 132800, 164760, 601902, 601253, 
173470, 602930, 603691, 165360, 123835, 603544, 147265, 107777, 602377, 600605, 602869, 601121, 602388, 
301300, 590060, 601059, 168461 , 142600, 314555, 142410, 170290, 602498, 190090, 311030, 142409, 1 90181 , 
600966, 3081 00, 600936, 1 71 900, 1 901 82, 1 72405, 1 02600, 603754, 600281 , 1 72430, 1 38249, 300039, 6031 54, 
602642, 300041 , 602659, 1 92240, 603473, 1 36537, 6031 34, 1 82280, 603084, 1 25660, 1 89990, 1 33430, 603755, 
603302, 600737, 300144, 305400, 188545, 600652, 189940, 600633, 603009, 

181031, 182160, 148500, 602384, 600650,602190, 171150, 173490, 164765, 147780, 600910, 601272, 147880, 
125255, 120090, 173390, 602668, 600483, 602995, 602996, 180860, 142445, 116949, 602681, 603027, 182125, 
600327, 600799, 164757, 230450, 602730, 600782, 164880, 232700, 116935, 602303, 603203, 164710, 600244, 
600206, 263200, 300034, 601487, 600980, 600956, 601985, 600884, 601956, 191041 , 601568, 601586, 155960, 
602288, 176761, 113900, 600581, 516050, 190070, 164940, 600544, 107941,312861,602373, 123836, 106165, 
103850,274270, 601922, 602355, 1 47420, 311010, 163800, 600536,602018,307200,603619, 603652, 193001, 
193245, 603729, 603752, 118888, 603796, 305600, 603852, 603853, 603886 

8) Calmodulin kinase (a member of signal transduction) 
35 entries found, searching for "calmodulin binding" 

300172, 114180, 302020, 139312, 602584, 602293, 600310, 603379, 114106, 602350, 114105, 114213, 138249, 
153430, 177061 , 182520, 102680, 600489, 601478, 306000, 600490, 600922, 601302, 601568, 168890, 164795, 
1 63730, 602698, 602699, 602987, 1 02681 , 603384 

9) Transcription factor 

717 entries found, searching for "transcription factor" 

305371 , 1 89963, 1 641 77, 600297, 600298, 1 63260, 1 89907, 1 731 1 0, 600733, 1 89903, 1 89972, 600438, 600281 , 
189962, 157670, 602272, 600609, 189968, 1 89971 , 189889, 107580, 189908, 601748, 601542, 601632, 600610, 
189969, 600480, 601760, 176310, 603022, 600519, 601750, 123803, 600520, 600494, 164176, 602460, 603246, 
113725, 107773, 189904, 602228, 147141, 132890, 189964, 164175, 142410, 313650, 164011 , 164014, 164005, 
601714, 600492, 164012, 601878, 600673, 600635, 191523, 602191 , 600490, 602149, 189967, 140580, 164343, 
603256, 600727, 600729, 600728, 140581 , 600571 , 601511 , 102582, 189906, 600660, 124097, 314310, 600860, 
600662, 184756, 600777, 600013, 600172, 600489, 601425, 600786, 600502, 602406, 600743, 189965, 603257, 
602751, 602542, 603148, 603107, 603789, 602318, 156845, 123811 , 600425, 

600540, 600695, 601622, 147574, 601 601 , 602407, 602150, 601538, 600663, 603306, 165170, 187040, 602444, 
189902, 189973, 600659, 600661, 601010, 600788, 602617, 601602, 602053, 601742, 300039, 602438, 602976, 
600744, 602543, 602479, 600481 , 600473, 603739, 600426, 603738, 189901 , 603677, 600427, 603255, 600607, 
600379, 189909, 601679, 600787, 602160, 601043, 601397, 601044, 600366, 300025, 602575, 602669, 601804, 
601 801 , 1 7631 2, 1 7631 1 , 600746, 602480, 602944, 600967, 60091 2, 306700, 306900, 1 93400, 601 206, 480000, 
191160, 601861, 164008, 600475, 600773, 600772, 600774, 142409, 156490, 600589, 601490, 1 51385, 600599, 
184757, 602955, 234000, 603433, 603349, 603198, 602294, 600390, 603628, 147620, 600211 , 601787, 601863, 
147470, 603795, 603734, 152760, 104155, 128990, 601729, 600197, 147370, 

1 73490, 603423, 600822, 1 88595, 603243, 600573, 601 689, 1 42765, 6031 81 , 600879, 603731 , 600288, 602295, 
121360, 164874, 300019, 162095, 602355, 603258, 126090, 159540, 300070, 600555, 600664, 601874, 153245, 
191191, 601126, 601512, 146733, 131550, 142385, 601796, 603406, 602959, 601734, 601732, 139191, 139139, 
600633, 138971 , 600006, 603170, 601488, 147576, 147680, 601498, 602630, 602643, 603364, 600914, 154040, 
602746, 1 28992, 1 43089, 1 60900, 6001 40, 1 34934, 1 3351 0, 1 76860, 1 901 80, 601 1 50, 601 1 75, 1 70993, 601 361 , 
122560, 602778, 308230, 602903, 309550, 601788, 602946, 159970, 124092, 180200, 173410, 602356, 603015, 
600779, 603111, 187930, 602614, 600951 , 603200, 602369, 164770, 147569, 603300, 603301 , 159980, 134638, 
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603431, 147730, 603366, 603348, 600556, 602136, 164160, 310200, 152390, 

601241, 116897, 137295, 600576, 194070, 601487, 600698, 164810, 601769, 141900, 602225, 275350, 131100, 
179755, 600075, 162200, 165160, 116806, 600899, 123810, 133450, 216400, 278700, 1 90080, 164730, 191170, 
1 93300, 60061 8, 600999, 601 090, 1 061 50, 601 843, 1 33530, 1 1 0700, 602550, 1 38040, 1 33430, 3001 33, 1 63731 , 

5 602302, 126337,309548, 180245, 126110,602291, 109565, 107400,314670, 601444, 143100, 104760, 106180, 

601953, 600584, 125852, 602419, 600401 , 142200, 107680, 167414, 600020, 188400, 208900, 175100, 602700, 
601828, 139320, 602777, 600185, 602681 , 603023, 314997, 602848, 600284, 102578, 114290, 165095, 137070, 
602991 , 602421 , 600005, 602996, 31 4995, 1 52200, 1 51 900, 1 1 2260, 1 2901 0, 600892, 273800, 1 76760, 602341 , 
490000, 136533, 400003, 601007, 602229, 603620, 602218, 602116, 602020, 

10 1 42000, 601 955, 1 26340, 1 201 50, 1 93067, 1 82452, 1 42461 , 1 94558, 1 80660, 600756, 1 60745, 1 07741 , 1 0621 0, 

157640, 186770, 146738, 603759, 213700, 1 47880, 152391 , 277900, 151626, 107730, 600711 , 600246, 107470, 
600237, 2231 00, 1 07748, 600065, 600349, 426000, 1 94500, 1 54030, 227650, 600247, 31 4980, 1 09560, 305370, 
600800, 301000, 277700, 600838, 312173, 600439, 600440, 191315, 601595, 190450, 190070, 190020, 162360, 
1 31 320, 1 33540, 600993, 601 993, 1 59530, 601 902, 602868, 1 81 590, 601 724, 602260, 601 093, 1 87270, 1 64761 , 

15 602102, 603245, 136950, 106100, 601182, 1 67410, 601897, 602896, 170100, 602506, 104150, 176730, 601600, 

187011, 102600, 180380, 162080, 603450, 142967, 602301, 126375, 603372, 603355, 164720, 603250, 167409, 
167415, 602897, 601565, 185250, 182138, 601851 , 600749, 601575, 194548, 

154500, 601365, 194541 , 601621 , 601623, 601531 , 600790, 194355, 123830, 123812, 154540, 601415, 143055, 
601386, 194550, 186930, 131290, 601320, 601620, 601754, 601313, 184430, 182900, 182500, 600725, 147870, 

20 154365, 116953, 601297, 601296, 601265, 600796, 120436, 601644, 601930, 601643, 230200, 601645, 601972, 

600861, 602009, 601172, 601158, 601646, 180630, 600821, 118440, 601656, 601647, 150200, 601125, 601671, 
141850, 116899, 600697, 109270, 202110, 150570, 601108, 191339,601063, 109691 , 1 80240, 203100, 151430, 
179710, 111000, 176797, 238600, 104311 , 240300, 125255, 600423, 158070, 602439, 600324, 112261, 243305, 
602474, 174762, 600613, 602539, 138890, 1 38720, 114550, 173865, 602582, 602584, 173510, 600250, 602627, 

25 173325, 602635, 246530, 172425, 600193, 602691 , 600188, 170998, 152790, 

168468, 256540, 225250, 600848, 143400, 1 68461 , 262600, 168360, 601912, 602951 , 600017, 230000, 266600, 
602981 , 272800, 1 091 50, 1 02200, 603025, 603026, 6031 09, 1 67050, 6031 27, 6031 28, 1 65240, 230400, 31 3700, 
164975, 164875, 602017, 115500, 235800, 1 64873, 602110, 164785, 164772, 312865, 603296, 600542, 164740, 
602125, 309801 , 602148, 300007, 306955, 603368, 116940, 6021 81 , 603416, 126650, 163920, 300024, 603437, 

30 602209, 603576, 603607, 305435, 600944, 1 8041 0, 303630, 1 59557, 301 870, 1 3281 0, 1 00790, 603849, 603862, 

603881, 

[0074] There are several methods for analyzing the expression levels of genes associated with diseases. Differences 
in gene expression levels between diseased and normal tissues are studied by the analytical methods, for example, 

35 Northern hybridization and differential display. Other examples include a method with high-density cDNA filter, a method 
with DNA microarray and methods with PGR amplification (Experimental Medicine, Vol.17, No. 8, 980-1056 (1999); 
Cell Engineering (additional volume) DNA Microarray and Advanced PCR Methods, Muramatsu & Naba (eds.), Shu- 
junsya). The levels of gene expression between diseased tissues and normal tissues can be studied by any of these 
analytical methods. When explicit difference in expression level is observed for a gene, it can be concluded that the 

40 gene is closely associated with a disease or disorder. Instead of diseased tissues, cultured cells can be used for the 
assessment. Similarly, when gene expression is explicitly different between normal cells and cells reproducing disease- 
associated specific features, it can be concluded that the gene is closely associated with a disease or disorder. When 
the expression levels of genes are evidently varied during major cellular events (such as differentiation and apoptosis), 
the genes are involved in the cellular events and accordingly are candidates for disease- and/or disorder-associated 

45 genes. Further, genes exhibiting tissue-specific expression are genes playing important parts in the tissue functions 
and, therefore, can be candidates for genes associated with diseases and/or disorders affecting the tissues. 
[0075] For example, non-enzymic protein glycation reaction is believed to be a cause for a variety of chronic diabetic 
complications. Accordingly, in endothelial cells, genes, of which expression levels are elevated or decreased in a gly- 
cated protein-dependent manner, are associated with diabetic complications caused by glycated proteins (Diabetes 

so 1996, 45 (Suppl. 3), S67-S72; Diabetes 1997, 46 (Suppl. 2), S19-S25). The onset of rheumatoid arthritis is thought to 
be involved in the proliferation of synovial cells covering inner surfaces of joint cavity and in inflammatory reaction 
resulted from the action of cytokines produced by leukocytes infiltrating into the joint synovial tissues (Rheumatism 
Information Center, http://www.rheuma-net.or.jp/). Recent studies have also revealed that tissue necrosis factor (TNF)- 
cc participates in the onset (Current opinion in immunology 1 959, 1 1 , 657-662). When the expression of a gene exhibits 

55 responsiveness to the action of TNF on synovial cells, the gene is considered to be involved in rheumatoid arthritis. 
Many genes acting at the downstream of TNF-. and IL-1 . among inflammation-associated cytokines have been previ- 
ously identified. The respective stimulations are transduced through independent pathways of signaling cascade. There 
exists another signaling cascade for both stimulations, wherein NF-.B is a common transducing molecule shared by 



44 



EP 1 130 094 A2 



the two stimulations (J. Leukoc. Biol., 1994, 56(5): 542-547). It has also been revealed that many inflammation-asso- 
ciated genes, including IL-2, IL-6 and G-CSF, are varied in the expression levels in response to the signal through the 
common pathway (Trends Genet. 1 999, 1 5(6): 229-235). It is assumed that genes of which expression levels are varied 
in response to the stimulation of TNF-. or IL-1. also participate in inflammation. Genes associated with neural differ- 
5 entiation can be candidates for causative genes for neurological diseases as well as candidates for genes usable for 
treating the diseases. 

[0076] Clones exhibiting differences in the expression levels thereof can be selected by using gene expression anal- 
ysis. The selection comprises, for example; analyzing cDNA clones by using high-density cDNA filter; and statistically 
treating the multiple signal values (signal values of radioisotope in the radiolabeled probes or values obtained by meas- 

10 uring fluorescence intensities emitted from the fluorescent labels) for the respective clones by two-sample t-test, where 
the signal values are determined by multiple experiments of hybridization. The clones of interest are selectable based 
on the statistically significant differences in the signal distribution at p<0.05. However, selectable clones with significant 
difference in the expression levels thereof may be changed depending on the partial modification of statistical treatment. 
For example, the clones may be selected by conducting statistical treatment with two-sample t-test at p<0.01 ; or genes 

15 exhibiting more explicit differences in the expression levels thereof can be selected by performing statistical treatment 
with a pre-determined cut-off value for the significant signal difference. An alternative method is that the expression 
levels are simply compared with each other, and then, the clones of interest are selected based on the ratio of the 
expression levels thereof. 

[0077] Clones exhibiting differences in the expression levels can also be selected by comparing the expression levels 
20 by PCR analysis, for example, by using the method of determining the band intensities representing the amounts of 
PCR products with ethidium bromide staining; the method of determining the radioisotopic signal values or fluorescence 
intensities of the PCR products when radio-labeled or fluorescence-labeled primers; or the method of determining the 
values of radioisotope signals or fluorescence intensities of the probes hybridized to the PCR products when radio- 
labeled or fluorescence-labeled probes, respectively, are used in the hybridization. If the expression level ratios ob- 
25 tained in multiple PCR experiments are constantly at least 2-fold, such a clone can be judged to exhibit the difference 
in the expression level. When the ratios are several-fold or not less than 1 0-fold, the clone can be selected as a gene 
exhibiting the explicit difference in the expression level. 

[0078] A survey of genes of which expression levels are varied specifically to the glycated protein in the endothelial 
cells revealed three genes with elevated expression levels, NT2RP2001538, NT2RP4001001 and Y79AA1 000967. 
30 These clones are genes associated with diabetes. 

[0079] A survey of genes of which expression levels are varied in response to TNF. (Tumor Necrosis Factor-alpha) 
in the primary cell culture of synovial tissue detected the following clones with elevated expression levels in the presence 
of TNF: 

BNGH41 000020, HEMBA1 000349, HEMBA1 000634, HEMBA1 000671 , HEMBA1 000835, H EM BA 1000962, 

35 HEMBA1002178, HEMBA1002195, HEMBA1002239, HEMBA1 002420, HE MBA 1002524, HEMBA1002992, 
HEMBA1003315, HEMBA1003392, HEMBA1003487, HEMBA1003602, HE MBA 1004067, H EM BA 1004797, 
HEMBA1005337, HEMBA1005489, HEM3A1006916, HEMBB1000668, HEMBB1000905, HEMBB1001 547, 
HEMBB1001573, HEMBB1 002041, HEMBB1 002663, MAMMA1 000652, MAMMA1 000810, MAMMA1001634, 
MAMMA1002091, MAMMA1002234, NT2RM2000306, NT2RM4000417, NT2RP1000002, NT2RP1000181, 

40 NT2RP1 000740, NT2RP2000694, NT2RP2001921, NT2RP2002527, NT2RP2004495, NT2RP2004606, 
NT2RP2005163, NT2RP2005463, NT2RP2006134, NT2RP3000171, NT2RP3000652, NT2RP3001195, 
NT2RP3001976, NT2RP3003473, NT2RP3003874, NT2RP3004090, NT2RP3004294, NT2RP3004557, 
NT2RP3004647, NT2RP4000108, NT2RP4001001, NT2RP4001877, OVARC1 000090, OVARC1000105, 
OVARC1 000275, OVARC1 000439, OVARC1001607, PLACE1 000740, PLACE1 000927, PLACE1001016, 

45 PLACE1001100, PLACE1001464, PLACE1001500, PLACE1001918, PLACE1 002095, PLACE 1002547, 
PLACE1 003644, PLACE1004519, PLACE1 005031, PLACE1 005410, PLACE1 005736, PLACE1006219, 
PLACE1 006809, PLACE1008716, PLACE1010081, THYRO1001770, Y79AA1000127, Y79M1 000207, 
Y79AA1 000270, Y79AA1 000876, Y794A1001013, Y79AA1001264, Y79AA1001272, Y79AA1001328, 
Y79AA1001430, Y79AA1001530, Y79AA1001799 

50 [0080] Clones with decreased expression levels in the presence of TNFDare NT2RM4000326, NT2RP1 000300, 
NT2RP2000514, NT2RP2001755, NT2RP2006042, NT2RP3000481, NT2RP3002790. These clones are candidates 
for rheumatoid arthritis-associated genes. 

[0081] A survey of genes of which expression levels are varied in response to TNF. (Tumor Necrosis Factor-alpha) 
or IL-1 . (lnterleukin-1 beta) in a human T cell strain, Jurkat cell, revealed the following clones with elevated expression 
55 levels in the presence of TNF.: 

MAMMA1000141, MAMMA1000788, MAMMA1001237, MAMMA1002070,NT2RM2000582, NT2RM2002109, 
NT2RP1 000679, NT2RP2003664, NT2RP2004108, NT2RP2005597, NT2RP3001592, NT2RP3002738, 
NT2RP3004133, NT2RP3004294, NT2RP3004321 , NT2RP3004557, PLACE1002547, PLACE1003573, 
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PLACE1 004305, PLACE1 008744, PLACE1010011 , PLACE1010713, PLACE1011181, Y79AA1 000776, 
Y79AA1002129 

[0082] The survey also revealed a clone of which expression level was decreased in the presence of TNF. The clone 
is PLACE1 002070. The same survey further revealed the clones of which expression levels were elevated in the 
presence of IL-1.. The clones are MAMMA1000614, MAMMA1001237, NT2RM2000514 and NT2RP3001159. These 
clones are genes associated with inflammation. 

[0083] A survey of genes of which expression levels are varied in response to the stimulation for inducing cell dif- 
ferentiation (stimulation using retinoic acid (RA) or using RA/inhibitor (inhibitor for cell division)) in cultured cells of 
neural strain, NT2, revealed the following clones with elevated expression levels in the presence of RA: 



HEMBA1000121, 
HEMBA1001184, 
HEMBA1 002421, 
HEMBA1004110, 
HEMBA1 005489, 
HEMBA1 006658, 
HEMBB1 000567, 
HEMBB1002120, 
MAMMA1 000528, 
MAMMA1 002224, 
NT2RM4000198, 
NT2RP1000181, 
NT2RP2000092, 
NT2RP2001388, 
NT2RP2002824, 
NT2RP2003383, 



HEMBA1 000275, 
HEMBA1001390, 
HEMBA1 003072, 
HEMBA1 004391, 
HEMBA1005913, 
HEMBA1 006749, 
HEMBB1 000642, 
HEMBB1002228, 
MAMMA1 000810, 
NT2RM2000423, 
NT2RM4000284, 
NT2RP1000261, 
NT2RP2000479, 
NT2RP2001538, 
NT2RP2002942, 
NT2RP2003469, 



HEMBA1 000300, 
HEMBA1001886, 
HEMBA1003120, 
HEMBA1 004444, 
HEMBA1 006299, 
HEMBA1 007063, 
HEMBB1 000668, 
HEMBB1 002693, 
MAM MA 1000881, 
NT2RM2000497, 
NT2RM4000593, 
NT2RP1 000465, 
NT2RP2000610, 
NT2RP2001878, 
NT2RP2002974, 
NT2RP2003664, 



HEMBA1 000634 
HEMBA1002163 
HEMBA1 003294, 
HEMBA1 005230 
HEMBA1 006357 
HEMBA1 007241 
HEMBB1001026 
MAMMA1000106, 
MAMMA1001634 
NT2RM2000582, 
NT2RM4001321 
NT2RP1 000468, 
NT2RP2000663, 
NT2RP2002015, 
NT2RP2002976, 
NT2RP2003940, 



NT2RP2004524, NT2RP2004556, NT2RP2004670, NT2RP2005069, 



NT2RP2005247, 
NT2RP2005774, 
NT2RP3000011, 
NT2RP3000652, 
NT2RP3001170, 
NT2RP3001976, 
NT2RP3003532, 
NT2RP3004090, 
NT2RP4000634, 
OVARC1 000553, 
OVARC1001030, 
PLACE1 000442, 
PLACE1002518, 
PLACE1 003839, 
PLACE1 005736, 
PLACE1 007626, 



NT2RP2005425, 
NT2RP2005878, 
NT2RP3000125, 
NT2RP3000677, 
NT2RP3001240, 
NT2RP3002790, 
NT2RP3003729, 
NT2RP3004130, 
NT2RP4002451, 
OVARC1 000775, 
OVARC1001049, 
PLACE1 000927, 
PLACE1 002547, 
PLACE1 004078, 
PLACE1 005768, 
PLACE1 008282, 



NT2RP2005463, 

NT2RP2005883, 

NT2RP3000171, 

NT2RP3000818, 

NT2RP3001322, 

NT2RP3002900, 

NT2RP3003874, 

NT2RP3004202, 

NT2RP4002715, 

OVARC1 000853, 

OVARC1001132, 

PLACE1 000986, 

PLACE1 002967, 

PLACE1 004441, 

PLACE1005815, 

PLACE1 008984, 



NT2RP20055H 
NT2RP2005887, 
NT2RP3000232 
NT2RP3000820, 
NT2RP3001388 
NT2RP3002983 : 
NT2RP3003939 : 
NT2RP3004294, 
OVARC1 000090, 
OVARC1 000873 
OVARC1001596 
PLACE 1001 100 
PLACE1003407 : 
PLACE1004450 : 
PLACE1006073 : 
PLACE1 008985, 



HEMBA1 000671 
HEMBA1 002227 
HEMBA1 003497 
HEMBA1 005246 
HEMBA1006517 
HEMBB1 000447 
HEMBB1001847 
MAMMA1 000141, 
MAMMA1001957 
NT2RM2001126 
NT2RP1 000002 
NT2RP1 000579 
NT2RP2000694 
NT2RP2002304 
NT2RP2003179 
NT2RP2004069 

NT2RP2005535 
NT2RP2006099 
NT2RP3000460 : 
NT2RP3001044 
NT2RP3001542 
NT2RP3003000 
NT2RP3004025 
NT2RP3004640 
OVARC1 000208 
OVARC1000916 
OVARC1002178 
PLACE 1001 123, 
PLACE 1003428 
PLACE1 005669 
PLACE1 006208 
PLACE1010445 



HEMBA1 000875, 
HEMBA1 002420, 
HEMBA1 004007, 
HEMBA1 005267, 
HEMBA1 006544, 
HEMBB1 000542, 
HEMBB1 002051, 
MAMMA1 000473, 
MAM MA1 002205, 
NT2RM2001902, 
NT2RP1 000050, 
NT2RP1 000679, 
NT2RP2000903, 
NT2RP2002721, 
NT2RP2003302, 
NT2RP2004108, 

NT2RP2005541, 
NT2RP2006134, 
NT2RP3000481, 
NT2RP3001061, 
NT2RP3001 592, 
NT2RP3003354, 
NT2RP3004083, 
NT2RP4000108, 
OVARC1 000275, 
OVARC1 000995, 
PLACE1 000258, 
PLACE1001795, 
PLACE1 003644, 
PLACE1 005682, 
PLACE1 007296, 
PLACE1011708, 



PLACE1011978, PLACE4000455, SKNMC1 000004, THYR01 000036, 

THYROI 000580, THYR01 000776, THYR01 000999, THYRO1001063, THYRO1001128, THYRO1001205, 
THYRO1001327, THYRO1001523, THYRO1001725, THYRO1001 770, Y79AA1 000207, Y79AA1 000226, 
Y79AA1 000270, Y79AA1001056, Y79AA1001062, Y79AA1001090, Y79AA1001727, Y79AA1002213, 
Y79AA1 002381 

[0084] The survey also revealed the clones of which expression levels were decreased in the presence of RA. The 

clones are BNGH41 000020, HEMBA1 005070, NT2RP2005027, NT2RP3003473 and Y79AA1 002376. 

[0085] The same survey further revealed the following clones with elevated expression levels in the presence of RA/ 

inhibitor: 



HEMBA1000128, 

HEMBA1 004391, 

HEMBA1 006335, 

MAMMA1001344, 

NT2RP2001388, 

NT2RP2004837, 

NT2RP2005887, 



HEMBA1000875, 
HEMBA1 004454, 
HEMBA1 006544, 
NT2RM2000582, 
NT2RP2002674, 
NT2RP2005069, 
NT2RP3000460, 



HEMBA1001390, 
HEMBA1 004785, 
HEMBA1 007241, 
NT2RP1001004, 
NT2RP2002974, 
NT2RP2005425, 
NT2RP3000838, 



HEMBA1002163, 
HEMBA1005913, 
HEMBB1 000447, 
NT2RP2000663, 
NT2RP2003383, 
NT2RP2005463, 
NT2RP3001044, 



HEMBA1 002227, 
HEMBA1006171, 
HEMBB1 000668, 
NT2RP2000694, 
NT2RP2004069, 
NT2RP2005541, 
NT2RP3001240, 



HEMBA1 002421, 
HEMBA1006299, 
MAMMA1 000994, 
NT2RP2000903, 
NT2RP2004606, 
NT2RP2005883, 
NT2RP3001388, 
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NT2RP3002721, 
NT2RP3004294, 
OVARC1 000467, 
OVARC1 001030, 
PLACE1001123, 
PLACE1 003407, 
PLACE1 005601, 
PLACE1006716, 
PLACE1011824, 



NT2RP3002738, 
NT2RP3004640, 
OVARC1 000553, 
OVARC1001222, 
PLACE1001407, 
PLACE1 003644, 
PLACE1 005725, 
PLACE1 007296, 
THYR01 000678, 



NT2RP3003469, 
NT2RP4000108, 
OVARC1 000853, 
OVARC1001596, 
PLACE1001464, 
PLACE1 003845, 
PLACE1 005736, 
PLACE1 007626, 
THYR01 000776, 



NT2RP3004083, 
NT2RP4002451, 
OVARC1 000873, 
OVARC1 002058, 
PLACE1001564, 
PLACE1 004441, 
PLACE1 006093, 
PLACE1 008359, 
THYR01 000999, 



NT2RP3004130, 
NT2RP4002715, 
OVARC1000916, 
OVARC1002178, 
PLACE1001795, 
PLACE1 004482, 
PLACE1006219, 
PLACE1010968, 
THYRO1001113, 



NT2RP3004202, 
OVARC1 000275, 
OVARC1 000995, 
PLACE1 000927, 
PLACE1 002547, 
PLACE1 005410, 
PLACE1 006290, 
PLACE1011364, 
THYRO1001237, 



10 THYR01 001 523, Y79AA1 000226, Y79AA1 000888, Y79AA1 001 430 

[0086] The same survey further revealed the following clones with elevated expression levels in the presence of RA/ 
inhibitor: 

HEMBA1000349, HEMBA1001297, HEMBA1001878, HEMBA1005070, HEMBA1 006482, HEMBB1001959, 
NT2RM2001939, NT2RP1 000981, NT2RP2001469, NT2RP3003473, OVARC1001132, PLACE1001655, 
15 Y79AA1000127, Y79AA1 002381. These clones are associated with neural differentiation and, therefore, are candi- 
dates for genes associated with neurological diseases. 

[0087] Based on the functional analyses using a secretory protein, membrane protein, or proteins associated with 
signal transduction, glycoprotein, transcription, or diseases, it is possible to develop a medicine. 
[0088] In case of a membrane protein, it is most likely to be a protein that functions as a receptor or ligand on the 
20 cell surface. Therefore, it is possible to reveal a new relationship between a ligand and receptor by screening the 
membrane protein of the invention based on the binding activity with the known ligand or receptor. Screening can be 
performed according to the known methods. 

[0089] For example, a ligand against the protein of the invention can be screened in the following manner. Namely, 
a ligand that binds to a specific protein can be screened by a method comprising the steps of: (a) contacting a test 
25 sample with the protein of the invention or a partial peptide thereof, or cells expressing these, and (b) selecting a test 
sample that binds to said protein, said partial peptide, or said cells. 

[0090] On the other hand, for example, screening using cells expressing the protein of the present invention that is 
a receptor protein can also be performed as follows. It is possible to screen receptors that is capable of binding to a 
specific protein by using procedures (a) attaching the sample cells to the protein of the invention or its partial peptide, 

30 and (b) selecting celts that can bind to the said protein or its partial peptide. 

[0091] In a following screening as an example, first the protein of the invention is expressed, and the recombinant 
protein is purified. Next, the purified protein is labeled, binding assay is performed using a various cell lines or primary 
cultured cells, and cells that are expressing a receptor are selected (Growth and differentiation factors and their re- 
ceptors, Shin-Seikagaku Jikken Kouza Vol.7 (1991) Honjyo, Arai, Taniguchi, and Muramatsu edit, p203-236, Tokyo- 

35 Kagaku-Doujin). A protein of the invention can be labeled with Rl such as 125 l, and enzyme (alkaline phosphatase 
etc.). Alternatively, a protein of the invention may be used without labeling and then detected by using a labeled antibody 
against the protein. The cells that are selected by the above screening methods, which express a receptor of the protein 
of the invention, can be used for the further screening of an agonists or antagonists of the said receptor. 
[0092] Once the ligand binding to the protein of the invention, the receptor of the protein of the invention or the cells 

40 expressing the receptor are obtained by screening, it is possible to screen a compound that binds to the ligand and 
receptor. Also it is possible to screen a compound that can inhibit both bindings (agonists or antagonists of the receptor, 
for example) by utilizing the binding activities. 

[0093] When the protein of the invention is a receptor, the screening method comprises the steps of (a) contacting 
the protein of the invention or cells expressing the protein of the invention with the ligand, in the presence of a test 

45 sample, (b) detecting the binding activity between said protein or cells expressing said protein and the ligand, and (c) 
selecting a compound that reduces said binding activity when compared to the activity in the absence of the test sample. 
Furthermore, when the protein of the invention is a ligand, the screening method comprises the steps of (a) contacting 
the protein of the invention with its receptor or cells expressing the receptor in the presence of samples, (b) detecting 
the binding activity between the protein and its receptor or the cells expressing the receptor, and (c) selecting a com- 

50 pound that can potentially reduce the binding activity compared to the activity in the absence of the sample. 

[0094] Samples to screen include cell extracts, expressed products from a gene library, synthesized low molecular 
compound, synthesized peptide, and natural compounds, for example, but are not construed to be listed here. A com- 
pound that is isolated by the above screening using a binding activity of the protein of the invention can also be used 
as a sample. 

55 [0095] A compound isolated by the screening may be a candidate to be an agonist or an antagonist of the receptor 
of the protein. By utilizing an assay that monitors a change in the intracellular signaling such as phosphorylation which 
results from reduction of the binding between the protein and its receptor, it is possible to identify whether the obtained 
compound is an agonist or antagonist of the receptor. Also, the compound may be a candidate of a molecule that can 
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inhibit the interaction between the protein and its associated proteins (including a receptor) in vivo. Such compounds 
can be used for developing drugs for precaution or cures of a disease with which the protein is associated. 
[0096] Secretory proteins may regulate cellular conditions such as growth and differentiation. It is possible to find 
out a novel factor that regulates cellular conditions by adding the secretory protein of the invention to a certain kind of 
5 cell, and performing a screening by utilizing the cellular changes in growth or differentiation, or activation of a particular 
gene. 

[0097] The screening can be performed, for example, as follows. First, the protein of the invention is expressed and 
purified in a recombinant form. Then, the purified protein is microinjectedadded into a various kind of cell lines or 
primary cultured cells, and the change in the cell growth and differentiation is monitored. The induction of a particular 

w gene that is known to be involved in a certain cellular change is detected with the amounts of mRNA and protein. 
Alternatively, the amount of an intracellular molecule (low molecular compounds, etc.) that is changed by the function 
of a gene product (protein) that is known to be functioning in a certain cellular change is used for the detection. 
[0098] Once the screening reveals that the protein of the invention can regulate cellular conditions or the functions, 
it is possible to apply the protein as a pharmaceutical and diagnostic medicine for associated diseases by itself or by 

15 altering a part of it into an appropriate composition. 

[0099] As is above described for membrane proteins, the secretory protein provided by the invention may be used 
to explore a novel ligand-receptor interaction using a screening based on the binding activity to a known ligand or 
receptor. A similar method can be used to identify an agonist or antagonist. The resulting compounds obtained by the 
methods can be a candidate of a compound that can inhibit the interaction between the protein of the invention and 

20 an interacting molecule (including a receptor). The compounds may be able to use as a preventive, therapeutic, and 
diagnostic medicine for the diseases, in which the protein may play a certain role. 

[0100] Proteins associated with signal transduction or transcription may be a factor that affects a certain protein or 
gene in response to intracellular/extracellular stimuli. It is possible to find out a novel factor that can affect a protein or 
gene by expressing the protein provided by the invention in a certain types of cells, and performing a screening utilizing 

25 the activation of a certain intracellular protein or gene. 

[01 01 ] The screening may be performed as follows. First, a transformed cell expressing the protein is obtained. Then, 
the transformed cell line and the untransformed original cell are compared for the changes in the expression of a certain 
gene by detecting the amount of its mRNA or protein. Alternatively, the amount of an intracellular molecule (low mo- 
lecular compounds), which is changed by the function of a gene product (protein) that is known to function in a certain 

30 cellular change, may be used for the detection. Furthermore, the change of the expression of a certain gene can be 
detected by introducing a fusion gene that comprises a regulatory region of the gene and a marker gene (luciferase, 
beta-galactosidase, etc.) into a cell, expressing the protein provided by the invention into the cell, and estimating the 
activity of a marker gene product (protein). 

[0102] If the protein or gene of the invention is associated with diseases, it is possible to screen a gene or compound 
35 that can regulate its expression and/or activity either directly or indirectly by utilizing the protein of the present invention. 
[0103] For example, the protein of the invention is expressed and the recombinant protein is purified. Then, the 
protein and gene whose expression was affected in the presence of the protein of the invention is also purified, and 
the binding activity between the two proteins or genes is examined. The examination may be performed with pretreat- 
ment with a compound that is candidate of an inhibitor. In an alternative method, a transcription regulatory, region 
to locating in the 5'-upstream of the gene encoding the protein of the invention that is capable of regulating the expression 
of other genes is obtained, and fused with a marker gene. The fusion is introduced into a cell, and the cell is added 
with compounds to explore a regulatory factor of the expression of the said gene. 

[0104] The compound obtained by the screening can be used for developing pharmaceutical and diagnostic medi- 
cines for the diseases with which the protein of the present invention is associated. Similarly, if the regulatory factor 

45 obtained by the screening is a protein, the protein itself can be used as a pharmaceutical, and if there is a compound 
that affects the original expression level and/or activity of the protein, it also can be used for the same purpose. 
[0105] If the protein of the invention has an enzymatic activity, regardless of whether it is a secretory protein, mem- 
brane protein, or proteins associated with signal transduction, glycoprotein, transcription, or diseases, a screening may 
be performed by adding a compound to the protein of the invention under the suitable condition and monitoring the 

50 change of the compound. The enzymatic activity may also be utilized to screen for a compound that can inhibit the 
activity of the protein. 

[0106] In a screening given as an example, the protein of the invention is expressed and the recombinant protein is 
purified. Then, compounds are contacted with the purified protein, and the amount of the compound and the reaction 
products is examined. Alternatively, compounds that are candidates of an inhibitor are pretreated, then a compound 
55 (substrate) that can react with the purified protein is added, and the amount of the substrate and the reaction products 
is examined. 

[0107] The compounds obtained in the screening may be used as a medicine for diseases with which the protein of 
the invention is associated. Also they can be applied for tests that examine whether the protein of the invention functions 
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normally in vivo. 

[0108] Whether the secretory or membrane protein of the present invention is a novel protein associated with dis- 
eases or not is determined in another method than described above, by obtaining a specific antibody against the protein 
of the invention, and examining the relationship between the expression or activity of the protein and a certain disease. 
5 In an alternative way, it may be analyzed referred to the methods in "Molecular Diagnosis of Genetic Diseases" (Elles 
R. edit, (1996) in the series of "Method in Molecular Biology" (Humana Press). 

[0109] Disease-associated proteins are a target of the above described screenings and very useful for developing 
a drug that is capable of regulating the expression and activity of the protein. Also, they are useful in medicinal industry 
as a diagnostic marker of the related disease and as a target for gene therapy. 

10 [0110] Compounds isolated as mentioned above can be administered patients as it is, or after formulated into a 
pharmaceutical composition according to the known methods. For example, a pharmaceutical^ acceptable carrier or 
vehicle, specifically sterilized water, saline, plant oil, emulsifier, or suspending agent can be mixed with the compounds 
appropriately. The pharmaceutical compositions can be administered to patients by a method known to those skilled 
in the art, such as intraarterial intravenous, or subcutaneous injections. The dosage may vary depending on the weight 

15 or age of a patient, or the method of administration, but those skilled in the art can choose an appropriate dosage 
properly. If the compound is encoded by DNA, the DNA can be cloned into a vector for gene therapy, and used for 
gene therapy. The dosage of the DNA and the method of its administration may vary depending on the weight or age 
of a patient, or the symptoms, but those skilled in the art can choose properly. 

[0111] The protein encoded by the polynucleotide of the invention can be prepared as a recombinant protein or as 

20 a natural protein. For example, the recombinant protein can be prepared by inserting the polynucleotide encoding the 
protein of the invention into a vector, introducing the vector into an appropriate host cell and purifying the protein 
expressed within the transformed host cell, as described below. In contrast, the natural protein can be prepared, for 
example, by utilizing an affinity column to which an antibody against the protein of the invention (Current Protocols in 
Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, Section 16.1-16.19) is attached. The antibody used 

25 for the affinity purification may be either a polyclonal antibody, or a monoclonal antibody. Alternatively, in vitro translation 
(See, for example, "On the fidelity of mRNA translation in the nuclease-treated rabbit reticulocyte lysate system." Dasso 
M.C. , and Jackson R.J. (1 989) Nucleic Acids Res. 1 7: 31 29-31 44) may be used for preparing the protein of the invention. 
[01 1 2] Proteins functionally equivalent to the proteins of the present invention can be prepared based on the activities, 
which were clarified in the above-mentioned manner, of the proteins of the present invention. Using the biological 

30 activity possessed by the protein of the invention as an index, it is possible to verify whether or not a particular protein 
is functionally equivalent to the protein of the invention by examining whether or not the protein has said activity. 
[0113] Proteins functionally equivalent to the proteins of the present invention can be prepared by those skilled in 
the art, for example, by using a method for introducing mutations into an amino acid sequence of a protein (for example, 
site-directed mutagenesis (Current Protocols in Molecular Biology, edit, Ausubel et al., (1987) John Wiley & Sons, 

35 Section 8.1 -8.5). Besides, such proteins can be generated by spontaneous mutations. The present invention comprises 
the proteins having one or more amino acid substitutions, deletions, insertions and/or additions in the amino acid 
sequences of the proteins of the present invention (Table 370), as far as the proteins have the equivalent functions to 
those of the. proteins identified in the present Examples described later. 

[0114] There are no limitations in the number and sites of amino acid mutations, as far as the proteins maintain the 
40 functions thereof. The number of mutations is typically 30% or less, or 20% or less, or 10% or less, preferably within 
5% or less, or 3% or less of the total amino acids, more preferably within 2% or less or 1% or less of the total amino 
acids. From the viewpoint of maintaining the protein function, it is preferable that a substituted amino has a similar 
property to that of the original amino acid. For example, Ala, Val, Leu, He, Pro, Met, Phe and Trp are assumed to have 
similar properties to one another because they are all classified into a group of non-polar amino acids. Similarly, sub- 
45 stitution can be performed among non-charged amino acid such as Gly, Ser, Thr, Cys, Tyr, Asn, and Gin, acidic amino 
acids such as Asp and Glu, and basic amino acids such as Lys, Arg, and His. 

[0115] In addition, proteins functionally equivalent to the proteins of the present invention can be isolated by using 
techniques of hybridization or gene amplification known to those skilled in the art. Specifically, using the hybridization 
technique (Current Protocols in Molecular Biology, edit, Ausubel et al., (1987) John Wiley & Sons, Section 6.3-6.4)), 

so those skilled in the art can usually isolate a DNA highly homologous to the DNA encoding the protein identified in the 
present Example based on the identified nucleotide sequence (Table 370) or a portion thereof and obtain the functionally 
equivalent protein from the isolated DNA. The present invention includes proteins encoded by the DNAs hybridizing 
with the DNAs encoding the proteins identified in the present Example, as far as the proteins are functionally equivalent 
to the proteins identified in the present Example. Organisms from which the functionally equivalent proteins are isolated 

55 are illustrated by vertebrates such as human, mouse, rat, rabbit, pig and bovine, but are not limited to these animals. 
[0116] Washing conditions of hybridization for the isolation of DNAs encoding the functionally equivalent proteins 
are usually "1xSSC, 0.1% SDS, 37"; more stringent conditions are "O.SxSSC, 0.1 % SDS, 42."; and still more stringent 
conditions are "0.1 x SSC, 0.1% SDS, 65.". Alternatively, the following conditions can be given as hybridization con- 



49 



EP1 130 094 A2 



ditions of the present invention. Namely, conditions in which the hybridization is done at "6xSSC, 40% Formamide, 
25.", and the washing at D 1xSSC, 55." can be given. More preferable conditions are those in which the hybridization 
is done at °6xSSC, 40% Formamide, 37.", and the washing at "0,2xSSC, 55,". Even more preferable are those in which 
the hybridization is done at "6xSSC, 50% Formamide, 37.", and the washing at "O.lxSSC, 62.". The more stringent 

5 the conditions of hybridization are, the more frequently the DNAs highly homologous to the probe sequence are isolated. 
Therefore, it is preferable to conduct hybridization under stringent conditions. Examples of stringent conditions in the 
present invention are, washing conditions of "O.SxSSC, 0.1% SDS, 42.", or alternatively, hybridization conditions of 
"6xSSC, 40% Formamide, 37.", and the washing at "0.2xSSC, 55.". However, the above-mentioned combinations of 
SSC, SDS and temperature conditions are indicated just as examples. Those skilled in the art can select the hybridi- 

10 zation conditions with similar stringency to those mentioned above by properly combining the above-mentioned or 
other factors (for example, probe concentration, probe length and duration of hybridization reaction) that determines 
the stringency of hybridization. 

[0117] The amino acid sequences of proteins isolated by using the hybridization techniques usually exhibit high 
homology to those of the proteins of the present invention, which are shown in Table 370. The present invention en- 

15 compasses a polynucleotide comprising a nucleotide sequence that has a high identity to the nucleotide sequence of 
claim 8 (a). Furthermore, the present invention encompasses a peptide, or protein comprising an amino acid sequence 
that has a high identity to the amino acid sequence encoded by the polynucleotide of claim 8 (b). The term "high identity" 
indicates sequence identity of at least 40% or more; preferably 60% or more; and more preferably 70% or more. 
Alternatively, more preferable is identity of 90% or more, or 93% or more, or 95% or more, furthermore, 97% or more, 

20 or 99% or more. The identity can be determined by using the BLAST search algorithm. 

[0118] With the gene amplification technique (PCR) (Current Protocols in Molecular Biology, edit, Ausubel et al., 
(1987) John Wiley & Sons, Section 6.3-6.4)) using primers designed based on the nucleotide sequence (Table 370) 
or a portion thereof identified in the present Example, it is possible to isolate a DNA fragment highly homologous to 
the nucleotide sequence or a portion thereof and to obtain functionally equivalent protein to a particular protein identified 

25 in the present Example based on the isolated DNA fragment. 

[01 19] The "percent identity" of two amino acid sequences or of two nucleic acids is determined using the algorithm 
of Karlin and Altschul (Proc. Natl. Acad. Sei. USA 87:2264-2268, 1990), modified as in Karlin and Aitschul (Proc. Natl. 
Acad. Sei. USA 90:5873-5877, 1993). Such an algorithm is incorporated into the BLASTN and BLASTX programs of 
Altschul et al. (J. Mol. Biol.2 15:403-410, 1990). BLAST nucleotide searches are performed with the BLASTN program, 

30 score = 100, wordlength = 12. BLAST protein searches are performed with the BLASTX program, score = 50, word- 
length = 3. When gaps exist between two sequences, Gapped BLAST is utilized as described in Altschul et al. (Nucleic 
Acids Res.25:3389-3402,1997). When utilizing BLAST and Gapped BLAST programs, the default parameters of the 
respective programs (e.g., BLASTX and BLASTN) are used. See http://www.ncbi.nlm.nih.gov. 
[0120] The present invention also includes a partial peptide of the proteins of the invention. The partial peptide 

35 comprises a protein generated as a result that a signal peptide has been removed from a secretory protein. If the 
protein of the present invention has an activity as a receptor or a ligand, the partial peptide may function as a competitive 
inhibitor of the protein and may bind to the receptor (or ligand). In addition, the present invention comprises an antigen 
peptide for raising antibodies. For the peptides to be specific for the protein of the invention, the peptides comprise at 
least 7 amino acids, preferably 8 amino acids or more, more preferably 9 amino acids or more, and even more preferably 

40 10 amino acids or more. The peptide can be used for preparing antibodies against the protein of the invention, or 
competitive inhibitors of them, and also screening for a receptor that binds to the protein of the invention. The partial 
peptides of the invention can be produced, for example, by genetic engineering methods, known methods for synthe- 
sizing peptides, or digesting the protein of the invention with an appropriate peptidase. 

[0121] The present invention also relates to a vector into which the DNA of the invention is inserted. The vector of 
45 the invention is not limited as long as it contains the inserted DNA stably. For example, if E. colt is used as a host, 
vectors such as pBluescript vector (Stratagene) are preferable as a cloning vector. To produce the protein of the in- 
vention, expression vectors are especially useful. Any expression vector can be used as far as it is capable of expressing 
the protein in vitro, in E. coli, in cultured cells, or in vivo. For example, pBEST vector (Promega) is preferable for in 
vitro expression, pET vector (Invitrogen) for E. coli, pME18S-FL3 vector (GenBank Accession No. AB009864) for cul- 
so tured cells, and pME18S vector (Mol. Cell. Biol. (1988) 8: 466-472) for in vivo expression. To insert the DNA of the 
invention, ligation utilizing restriction sites can be performed according to the standard method (Current Protocols in 
Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, Section 11.4-11 .11). 

[01 22] The present invention also relates to a transformant carrying the vector of the invention. Any cell can be used 
as a host into which the vector of the invention is inserted, and various kinds of host cells can be used depending on 
55 the purposes. For strong expression of the protein in eukaryotic cells, COS cells or CHO cells can be used, for example. 
[0123] Introduction of the vector into host cells can be performed, for example, by calcium phosphate precipitation 
method, electroporation method (Current Protocols in Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, 
Section 9.1-9.9), lipofectamine method (GIBCO-BRL), or microinjection method, etc. 



50 



EP 1 130 094 A2 



[0124] The primer of the present invention can be used for synthesizing full-length cDNA, and also for the detection 
and/or diagnosis of the abnormality of the protein of the invention encoded by the full-length cDNA. For example, by 
utilizing polymerase chain reaction (genomic DNA-PCR, or RT-PCR) using the primer of the invention, DNA encoding 
the protein of the invention can be amplified. It is also possible to obtain the regulatory region of expression in the 5'- 

5 upstream by using PGR or hybridization since the transcription start site within the genomic sequence can be easily 
specified based on the 5'-end sequence of the full-length cDNA. The obtained genomic region can be used for detection 
and/or diagnosis of the abnormality of the sequence by RFLP analysis, SSCP, or direct sequencing. 
[01 25] Furthermore, the "polynucleotide having a length of at least 1 5 nucleotides, comprising a nucleotide sequence 
that is complementary to a polynucleotide comprising the nucleotide sequence set forth in any one of SEQ ID NOs in 

10 Table 370, or its complementary strand" includes an antisense polynucleotide for suppressing the expression of the 
protein of the invention. To exert the antisense effect, the antisense polynucleotide has a length of at least 15 bp or 
more, for example, 50 bp or more, preferably 100 bp or more, and more preferably 500 bp or more, and has a length 
of usually 3000 bp or less and preferably 2000 bp or less. The antisense DNA can be used in the gene therapy of the 
diseases that are caused by the abnormality of the protein of the invention (abnormal function or abnormal expression). 

15 Said antisense DNA can be prepared, for example, by the phosphorothioate method ("Physicochemical properties of 
phosphorothioate oligodeoxynucleotides." Stein (1988) Nucleic Acids Res. 16: 3209-3221) based on the nucleotide 
sequence of the DNA encoding the protein (for example, the DNA set forth in any one of SEQ ID NOs in Table 370). 
[0126] The polynucleotide or antisense DNA of the present invention can be used in gene therapy, for example, by 
administrating it into a patient by the in vivo or ex vivo method with virus vectors such as retrovirus vectors, adenovirus 

20 vectors, and adeno-associated Virus vectors, or non-virus vectors such as liposome. 

[01 27] The present invention also relates to antibodies that bind to the protein of the invention. There are no limitations 
in the form of the antibodies of the invention. They include polyclonal antibodies, monoclonal antibodies, or their portions 
that can bind to the protein of the invention. They also include antibodies of all classes. Furthermore, special antibodies 
such as humanized antibodies are also included. 

25 [0128] The polyclonal antibody of the invention can be obtained according to the standard method by synthesizing 
an oligopeptide corresponding to the amino acid sequence and immunizing rabbits with the peptide (Current Protocols 
in Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, Section 11.12-11 .13). The monoclonal antibody of 
the invention can be obtained according to the standard method by purifying the protein expressed in E. colt, immunizing 
mice with the protein, and producing a hybridoma cell by fusing the spleen cells and myeloma cells (Current Protocols 

30 in Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, Section 11.4-11.11). 

[0129] The antibody binding to the protein of the present invention can be used for purification of the protein of the 
invention, and also for detection and/or diagnosis of the abnormalities of the expression and structure of the protein. 
Specifically, proteins can be extracted, for example, from tissues, blood, or cells, and the protein of the invention is 
detected by Western blotting, immunoprecipitation, or ELISA, etc. for the above purpose. 

35 [0130] Furthermore, the antibody binding to the protein of the present invention can be utilized for treating the dis- 
eases that associates with the protein of the invention. If the antibodies are used for treating patients, human antibodies 
or humanized antibodies are preferable in terms of their low antigenicity. The human antibodies can be prepared by 
immunizing a mouse whose immune system is replaced with that of human ("Functional transplant of megabase human 
immunoglobulin loci recapitulates human antibody response in mice" Mendez MJ. et al. (1997) Nat. Genet. 15: 

40 1 46-1 56). The humanized antibodies can be prepared by recombination of the hypervariable region of a monoclonal 
antibody (Methods in Enzymology (1991) 203: 99-121). 

[0131] The present invention further relates to databases comprising at least a sequence of polynucleotide and/or 
protein, or a medium recorded in such databases, selected from the sequence data of the nucleotide and/or the amino 
acids indicated in Table 370. The term "database" means a set of accumulated information as machine-searchable 

45 and readable information of nucleotide sequence. The databases of the present invention comprise at least one of the 
novel nucleotide sequences of polynucleotide provided by the present invention. The databases of the present invention 
can consist of only the sequence data of the polynucleotide provided by the present invention or can comprise other 
information on nucleotide sequences of known full-length cDNAs or ESTs. The databases of the present invention can 
be comprised of not only the information on the nucleotide sequences but also the information on the gene functions 

50 revealed by the present invention. Additional information such as names of DNA clones carrying the full-length cDNAs 
can be recorded or linked together with the sequence data in the databases. 

[0132] The database of the present invention is useful for gaining complete gene sequence information from partial 
sequence information of a gene of interest. The database of the present invention comprises nucleotide sequence 
information of full-length cDNAs. Consequently, by comparing the information in this database with the nucleotide 
55 sequence of a partial gene fragment yielded by differentia! display method or subtraction method, the information on 
the full-length nucleotide sequence of interest can be gained from the sequence of the partial fragment as a starting clue. 
[01 33] The sequence information of the full-length cDNAs constituting the database of the present invention contains 
not only the information on the complete sequences but also extra information on expression frequency of the genes 
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as well as homology of the genes to known genes and known proteins. Thus the extra information facilitates rapid 
functional analyses of partial gene fragments. Further, the information on human genes is accumulated in the database 
of the present invention, and therefore, the database is useful for isolating a human homologue of a gene originating 
from other species. The human homologue can be isolated based on the nucleotide sequence of the gene from the 
5 original species. 

[0134] At present, information on a wide variety of gene fragments can be obtained by differential display method 
and subtraction method. In general, these gene fragments are utilized as tools for isolating the full-length sequences 
thereof. When the gene fragment corresponds to an already-known gene, the full-length sequence is easily obtained 
by comparing the partial sequence with the information in known databases. However, when there exists no information 

10 corresponding to the partial sequence of interest in the known databases, cDNA cloning should be carried out for the 
full-length cDNA. It is often difficult to obtain the full-length nucleotide sequence using the partial sequence information 
as an initial clue. If the full-length of the gene is not available, the amino acid sequence of the protein encoded by the 
gene remains unidentified. Thus the database of the present invention can contribute to the identification of full-length 
cDNAs corresponding to gene fragments, which cannot be revealed by using databases of known genes. 

15 [01 35] The invention is illustrated more specifically with reference to the following examples, but is not to be construed 
as being limited thereto. 

EXAMPLE 1 

20 Construction of a cDNA library by the oligo-capping method. 

[0136] The NT-2 neuron progenitor cells (Stratagene), a teratocarcinoma cell line from human embryo testis, which 
can differentiate into neurons by treatment with retinoic acid were used. The NT-2 cells were cultured according to the 
manufacturer's instructions as follows. 

25 

(1) NT-2 cells were cultured without induction by retinoic acid treatment ((NT2RM1, NT2RM2, NT2RM4)). 

(2) After cultured, NT-2 cells were induced by adding retinoic acid, and then were cultured for 48 hours (NT2RP1 ). 

(3) After cultured, NT-2 cells were induced by adding retinoic acid, and then were cultured for 2 weeks (NT2RP2, 
NT2RP3, NT2RP4). 

30 

[0137] Also, the human brain neuroglioma cell line H4 (ATCC HTG-148) (BNGH41), human neuroblastoma cell line 
SK-N-MC (ATCC HTB-1 0) (SKNMC1 ), and human retinoblastoma cell line Y79 (ATCC HTB-1 8) (Y79AA1 ) were cultured 
according to the culture conditions described in the ATCC catalogue. The cells were harvested separately, and mRNA 
was extracted from each cell by the method described in the literature (Molecular Cloning 2nd edition. Sambrook J., 
35 Fritsch, E.F., and Maniatis T. (1989) Cold Spring Harbor Laboratory Press). Furthermore, poly(A) + RNA was purified 
from the mRNA using oligo-dT cellulose. 

[0138] Similarly, human placenta (PLACE1 , PLACE2, PLACE3), human ovary cancer tissue (OVARC1), tissues from 
human embryo at 10 weeks, which is enriched with head (HEMBA1), or body (HEMBB1), human mammary gland 
(MAMMA1), human thyroid gland (THYR01) were used to extract mRNA by the method described in the literature 
40 (Molecular Cloning 2nd edition. Sambrook J., Fritsch, E.F, and Maniatis T. (1989) Cold Spring Harbor Laboratory 
Press). Furthermore, poly(A) + RNA was purified from the mRNA using oligo-dT cellulose. 

[0139] Each poly(A) + RNA was used to construct a cDNA library by the oligo-capping method (Maruyama M. and 
Sugano S. (1994) Gene 138: 171-174). Using the Oligo-cap linker (SEQ ID NO: 2541) and the Oligo-dT primer (SEQ 
ID NO: 2542), bacterial alkaline phosphatase (BAP) treatment, tobacco acid phosphatase (TAP) treatment, RNA liga- 

45 tion, the first strand cDNA synthesis, and removal of RNA were performed as described in the reference (Suzuki and 
Kanno (1996) Protein Nucleic acid and Enzyme. 41: 197-201; Suzuki Y. et al. (1997) Gene 200: 149-156). Next, 5'- 
and 3"-PCR primers (SEQ ID NO: 2543, and 2544, respectively) were used for performing PCR to convert the cDNA 
into double stranded cDNA, which was then digested with SfiL Then, the Dralll-cleaved pUC19FL3 vector (Figure 1; 
for NT2RM1, and NT2RP1), or the Dralll-cleaved pME18SFL3 (Figure 1) (GenBank AB009864, expression vector; for 

50 NT2RM2, NT2RM4, NT2RP2, NT2RP3, NT2RP4, BNGH41, SKNMC1, Y79AA1, PLACE1, PLACE2, PLACE3, 
OVARC1 , HEMBA1 , HEMBB1 , MAMMA1 , and THYROI ) was used for cloning the cDNA in an unidirectional manner, 
and cDNA libraries were obtained. Then, the nucleotide sequence of the 5'- and 3'- ends of the cDNA clones was 
analyzed with a DNA sequencer (ABI PRISM 377, PE Biosystems) after sequencing reactions were performed with 
the DNA sequencing reagents (Dye Terminator Cycle Sequencing FS Ready Reaction Kit, dRhodamine Terminator 

55 Cycle Sequencing FS Ready Reaction Kit, or BigDye Terminator Cycle Sequencing FS Ready Reaction Kit, from by 
PE Biosystems) according to the instructions. The data were compiled into a database. 

[0140] The full-length-enriched cDNA libraries except those for NT2RM1 and NT2RP1 were constructed using eu- 
karyotic expression vector pME18SFL3. The vector contains SRoc promoter and SV40 small t intron in the upstream 
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of the cloning site, and SV40 polyA added signal sequence site in the downstream. As the cloning site of pME18SFL3 
has asymmetrical Dralll sites, and the ends of cDNA fragments contain Sfil sites complementary to the Dralll sites, 
the cloned cDNA fragments can be inserted into the downstream of the SRa promoter unidirectionally. Therefore, 
clones containing full-length cDNA can be expressed transiently by introducing the obtained plasmid directly into COS 
5 cells. Thus, the clones can be analyzed very easily in terms of the proteins that are the gene products of the clones, 
or in terms of the biological 

[0141] Herein, the cDNA libraries and the name of each clone are related as shown in Table 2. Therein, "xxxxxx" 
represents the clone number of six digits. Thus, the sequences are named by the library name, the clone number plus 
F- for the 5'-end, or R- for the 3'-end. 

10 

Table 2 
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library: 

clone 5'-end sequence 3'-end sequence 

NT2RM1: 

NT2RM 1 xxxxxx F-NT2RM 1 xxxxxx 
NT2RP1: 

NT2RP1 xxxxxx F-NT2RP1 xxxxxx 
NT2RM2: 

NT2RM2xxxxxx 
NT2RM4: 

NT2RM4xxxxxx 
NT2RP2: 

NT2RP2xxxxxx 
NT2RP3: 

NT2RP3xxxxxx 
NT2RP4: 

NT2RP4xxxxxx 
BNGH41: 

BNGH41 xxxxxx 
SKNMC1: 

SKNMClxxxxxx 
Y79AA1: 

Y79AAlxxxxxx 
PLACE1: 

PLACE 1 xxxxxx 
PLACE2: 

PLACE2xxxxxx 
PLACE3: 

PLACE3xxxxxx 
OVARC1: 

OVARC1 xxxxxx 
HEMBA1: 

HEMBA 1 xxxxxx F-HEMB A 1 xxxxxx R-HEMBA1 xxxxxx 
HEMBB1: 

HEMBB I xxxxxx F-HEMB B 1 xxxxxx R-HEMBB 1 xxxxxx 
MAMMA1: 

MAMMA 1 xxxxxx F-MAMMA1 xxxxxx R-MAMMA 1 xxxxxx 
THYROl: 



F-NT2RM2xxxxxx 
F-NT2RM4xxxxxx 
F-NT2RP2xxxxxx 
F-NT2RP3 xxxxxx 
F-NT2RP4xxxxxx 
F-BNGH41xxxxxx 
F-SKNMC1 xxxxxx 
F- Y79 AA 1 xx xxxx 
F-PLACE1 xxxxxx 
F-PLACE2xxxxxx 
F-PLACE3xxxxxx 
F-OVARClxxxxxx 



R-NT2RM2xxxxxx 

R-NT2RM4xxxxxx 
R-NT2RP2xxxxxx 
R-NT2RP3xxxxxx 
R-NT2RP4xxxxxx 

R-BNGH41 xxxxxx 
R-SKNMC1 xxxxxx 
R-Y79AA1 xxxxxx 
R-PLACE1 xxxxxx 
R-PLACE2xxxxxx 
R-PLACE3xxxxxx 

R-OVARC1 xxxxxx 
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THYROlxxxxxx F-THYROlxxxxxx R-THYROlxxxxxx 



5 

EXAMPLE 2 

Estimation of the fullness ratio of the 5'-ends of the clones contained in the cDNA libraries constructed by the oligo- 
capping method. 

10 

[0142] The fullness ratio at the 5'-end sequences of the 59,823 clones in the human cDNA libraries constructed by 
the oligo-capping method was determined as follows. Of all the clones whose 5'-end sequences were found in those 
of known human mRNA in the public database, a clone was judged to be "full-length", if it had a longer 5-end sequence 
than that of the known human mRNA, or, even though the 5-end sequence was shorter, if it contained the translation 
15 initiation codon. A clone which did not contain the translation initiation codon was judged to be "non-full-length". The 
fullness ratio ((the number of full-length clones)/(the number of full-length and non-full-length clones)) at the 5'-end of 
the cDNA clones from each library was determined by comparing with the known human mRNA. As a result, the fullness 
ratio of the S'-ends was 63.5%. It suggests that the human cDNA clones obtained by the described method have 
complete 5'-ends with high probability. 

20 

EXAMPLE 3 

Assessment of the fullness ratio of the 5'-end of the cDNA by the ATGpr and the ESTiMateFL. 

25 [0143] The ATGpr, developed by Salamov A.A., Nishikawa T, and Swindells M.B. in the Helix Research Institute, is 
a program for prediction of the translation start codon based on the characteristics of the sequences in the vicinity of 
the ATG codon. The results are shown with expectations that an ATG is a true start codon (0.05-0.94). When this 
program is applied to general cDNAs without considering whether or not the ATG codons in the cDNAs are the true 
initiation codons of the cDNAs, both the sensitivity and the specificity of the results are estimated at 66%. Here, the 

30 sensitivity means the ratio of the number of codons judged to be initiation codons by the program to the total number 
of true initiation codons, and the specificity means the ratio of the number of true initiation codons to the number of 
codons judged to be initiation codons by the program. In contrast, when the program was applied to the 5-end se- 
quences of the clones from the cDNA library that was obtained by the oligo-capping method and that had 65% fullness 
ratio, the sensitivity and specificity of evaluation of a full-length clone (clone containing the N-terminal end of ORF) 

35 were improved to 82-83% by selecting only clones having the ATGprl score 0.6 or higher. 

[0144] Furthermore, the program was used to assess the fullness of 18,959 clones in the human cDNA libraries 
obtained here, which have 5'-ends matched to a known human mRNA. Briefly, the maximal ATGprl score of the clones 
was determined, and then their 5'-end sequence was compared with the known human mRNA to estimate whether 
the clone is full-length or not. The result was summarized in Table 3. Based on the knowledge that known mRNAs, in 

40 general, are highly expressed in the cell, the expression levels of genes having a low number in the EST hit, which 
represent mRNAs whose expression levels are relatively low were examined, and the result is shown in Table 4. 
[0145] In the table, the number of full-length clones indicate that of clones containing the N-terminal end of ORF, 
and so does the number of non-full-length clones that of clones without the N-terminal end of ORF. The fullness ratio 
represents (the number of full-length clones)/(the number of full-length clones plus the number of non-full-length 

45 clones). 



Table 3 



50 


The maximal ATGprl score and the fullness ratio of the 5'-end sequences of clones obtained from human cDNA 
libraries constructed by the oligo-capping method; clones having a matched 5'-end with that of a known human 
mRNA. 




maximal ATGprl score 


number of (full-length 
clones plus non-full-length 
clones) 


number of full-length 
clones 


fullness ratio 


55 


>=0.70 


11,193 


9,346 


83.5% 




>=0.50 


13,369 


10,549 


78.9% 




>=0.30 


15,489 


11,340 


73.2% 
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Table 3 (continued) 



The maximal ATGpM score and the fullness ratio of the 5'-end sequences of clones obtained from human cDNA 
libraries constructed by the oligo-capping method; clones having a matched 5'-end with that of a known human 
mRNA. 


maximal ATGpM score 


number of (full-length 


number of full-length 


fullness ratio 




clones plus non-full-length 


clones 






clones) 






>=0.15 
>=0.00 


17,394 
18,959 


11,811 
12,046 


67.9% 
63.5% 



Table 4 



The maximal ATGpM score and the fullness ratio of the 5'-end sequences of the clones obtained from human cDNA 


libraries constructed by the oligo-capping method; clones having 5 EST hits or less among the clones having a 


matched 5'-end with that of a known human mRNA. 






maximal ATGpM score 


number of (full-length 


number of full-length 


fullness ratio 




clones plus non-full- 


clones 






length clones) 






>=0.70 


2,801 


1,934 


69.0% 


>=0.50 


3,683 


2,393 


65.0% 


>=0.30 


4,683 


2,707 


57.8% 


>=0.15 


5,559 


2,890 


52.0% 


>=0.00 


6,113 


3,013 


49.8% 



[0146] The ESTiMateFL, developed by Nishikawa and Ota in the Helix Research Institute, is a method for the se- 
lection of a clone with high fullness ratio by comparing with the 5'-end or 3'-end sequences of ESTs in the public 
database. 

[0147] By the method, a cDNA clone is judged presumably not to be full-length if there exist any ESTs which have 
longer 5'-end or 3'-end sequences than the clone. The method is systematized for high throughput analysis. A clone 
is judged to be full-length if the clone has a longer 5'-end sequence than ESTs in the public database. Even if a clone 
has a shorter 5'-end, the clone is judged to be full-length if the difference in length is within 50 bases, and otherwise 
judged not to be full-length, for convenience. In case of the 5'-end sequence of the clones which matches a known 
mRNA, about 80% of the sequences that were judged to be full-length by comparing with ESTs was judged to be full- 
length by estimating the 5'-end sequence, as well; about 80% of the sequences that were judged to be not full-length 
by comparing with ESTs was judged to be not full-length by estimating the 5'-end sequence, as well. The accuracy of 
the prediction by comparing cDNA clones with ESTs is improved with increasing number of ESTs to be compared. 
However, when only a limited number of ESTs are available, the reliability becomes low. Thus, the method is effective 
in excluding clones with high probability of being non-full-length, from the cDNA clones that is synthesized by the oligo- 
capping method and that have the 5'-end sequences with about 60 % fullness ratio. In particular, the ESTiMateFL is 
efficiently used to estimate the fullness ratio at the 3'-end sequence of cDNA of a human unknown mRNA which has 
a significant number of ESTs in the public database. 

[01 48] The 1 8,959 clones isolated from human cDNA libraries constructed by the oligo-capping method, which have 
the 5-end sequence that matches a known human mRNA, were estimated by using the ATGpr and ESTiMateFL. Briefly, 
the 5'-end sequence that matches a known human mRNA of the respective clone was analyzed to obtain the maximal 
ATGpM score, and compared with the ORF of the known human mRNA that matches it to determine whether the clone 
is full-length or not. Then, the 5'-end sequence of the respective clone was analyzed by the ESTiMateFL to judge 
whether the clone is full-length or not. Specifically, the 5'-end sequences that match a known human mRNA of the 
18,959 clones constructed by the oligo-capping method were compared with those of ESTs by the ESTiMateFL and 
the clones other than those that are not full-length were selected. Then, the selected clones were used to analyze the 
relationship between the ATGpr and the fullness ratio. The result was summarized in Table 5. Also, among the selected, 
the clones in which the number of the EST hit is not more than 5 were selected and analyzed. The result was summarized 
in Table 6, which represents the result of the analysis of mRNA with relatively low abundance. 
[0149] In the Tables, the number of full-length clones, the number of non-full-length clones, and the fullness ratio 
indicate the number of the clones that contain the N-terminus of the ORF, the number of the clones that do not contain 
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the N-terminus of the ORF, and (the number of full-length clones)/(the number of full-length clones) plus (the number 
of non-full-length clones), respectively. 



Table 5 



The maximal ATGprl score and the fullness ratio of the 5'-end sequence in the clones isolated from human cDNA 


libraries constructed by the oligo-capping method, which have the 5'-end sequence that matches a known human 


mRNA, and also other than those being not full-length according to the comparison with ESTs. 


maximal ATGprl score 


number of (full-length 


number of full-length 


fullness ratio 




clones plus non-full- 


clones 






length clones) 






>=0.70 


9,068 


8,349 


92.1% 


>=0.50 


10,345 


9,318 


90.1% 


>=0.30 


11,425 


9,964 


87.2% 


>=0.15 


12,254 


10,335 


84.3% 


>=0.00 


12,785 


10,484 


82.0% 



Table 6 



The maximal ATGprl score and the fullness ratio of the 5'-end sequence in the clones isolated from human cDNA 


libraries constructed by the oligo-capping method, which have the 5'-end sequence that matches a known human 


mRNA, and also other than those being not full-length according to the comparison with ESTs, in which the number 


of the EST hit is not more than 5. 






maximal ATGprl score 


number of (full-length 


number of full-length 


fullness ratio 




clones plus non-full- 


clones 






length clones) 






>=0.70 


1,959 


1,510 


77.1% 


>=0.50 


2,469 


1,821 


73.8% 


>=0.30 


2,975 


2,046 


68.8% 


>=0.15 


3,368 


2,164 


64.3% 


>=0.00 


3,661 


2,226 


60.8% 



[0150] According to the above results, it was found that, in case of using clones isolated from human cDNA libraries 
constructed by the oligo-capping method, the fullness ratio of the clones that have low score in the ATGpr can be 
improved by assessing their 5'-end sequence using the combination of the ATGpr and the ESTiMateFL Therefore, the 
method was applied to select a cDNA clone with high fullness ratio. 



EXAMPLE 4 

Clustering of the 5-end and 3'-end sequences of cDNA clones. 

[0151] The 5'-end and 3'-end sequences of cDNA clones were obtained, and clustered separately. Briefly, data of 
the single pass sequencing of the determined 5'-end and 3'-end of cDNA clones was subjected to the BLAST search 
between the sequence data of all the clones synthesized in Example 1 , and clones that are supposed to be originating 
from the same gene were clustered into a group. For the 5'-end sequence, those having the consensus sequence of 
95% identity 300 base pairs or more are clustered into the same group. For the 3'-end sequence, those having the 
consensus sequence of 90% identity 200 base pairs or more are clustered into the same group. Among the clusters 
of the 5'-end and 3-end sequences, the sequence having the longest lead was chosen as the representative sequence 
of the cluster (group). 

EXAMPLE 5 

Characterization of the representative sequences and the sequences of clones 

[01 52] Data of the 5'-end sequences of the representative sequences and clones was characterized by the following 
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methods: 

(1) judging whether it is identical to the sequence of mRNA or ESTs from human by the BLAST search of the 
GenBank or SwissProt, and examining whether it is full-length by comparing with the sequences of known mRNA 

5 and ESTs from human. 

(2) determining the ATGpM score using all the initiation codons contained within the 5'-end sequence by the ATGpr 
which predict fullness ratio. 

(3) predicting the existence of the signal sequence using all the initiation codons contained within the 5'-end se- 
quence by the PSORT which predict signal. 

10 and, 

(4) only with the 5'-end sequences of the representative sequences of the clusters, examining the keywords in the 
top hit data of the homology search of the SwissProt. 

[0153] Data of the characterized representative sequences and clones was used for the final selection of the clones. 

15 

EXAMPLE 6 

Identity to the human mRNA and human EST, and comparison of the 5'-end length. 

20 [01 54] The clones and the representative sequences of the clusters were judged to be identical to any human mRNA, 
if their 5'-end sequence has a region of 200 nucleotides or longer with 94% or more identity to the mRNA. The clones 
and the representative sequences of the clusters were judged to be identical to any human EST, if their 5'-end sequence 
has a region of 200 nucleotides or longer with 90% or more identity to the EST. 

[01 55] The clones and the representative sequences of the clusters were judged to be full-length in comparison with 
25 human mRNA, if their 5'-end sequence is longer than those of the mRNA, or it contains the translation initiation site. 
The clones and the representative sequences of the clusters were judged to be full-length in comparison with human 
EST in the database, if their 5'-end sequence is longer than those of the EST, or even though it is shorter, the difference 
in length between the two sequences is 50 nucleotides or less, for convenience. Otherwise, the clones and the repre- 
sentative sequences of the clusters were judged to be not full-length. 

30 

EXAMPLE 7 

Prediction of the fullness ratio by the ATGpr. 

35 [01 56] The score in the ATGpM is the expectation to be full-length based on calculations, and the higher score reflects 
the higher fullness ratio as shown in Example 3. Further, the maximal ATGprl score represents the score obtained 
with all the initiation codons contained in the 5'-end sequence of the clones and the representative sequences, and 
are used for the characterization. 

40 EXAMPLE 8 

Prediction of the existence of a signal sequence by the PSORT 

[0157] Prediction of the existence of a signal sequence by the PSORT was performed on all of the amino acid se- 
45 quences predicted from all the initiation codons in the 5'-end sequence of the clones and the representative sequences 
of the clusters. By analyzing the presence or absence of the sequence which is predicted to be a signal sequence, 
which is characteristics of the N-termtnus of many secretory proteins, cDNA clones encoding a secretory protein or 
membrane protein were selected. 

50 EXAMPLE 9 

Prediction of the protein function by the BLAST search. 

[0158] The 5'-end sequence of the representative sequences of the cluster was analyzed by the BLAST homology 
55 search of the SwissProt. The obtained top hit data was classified into those identical to the 5'-end representative 
sequence (identity was 90% or higher), those not identical to the 5'-end representative sequence (identity was 60% or 
lower, and compared sequence was not more than 25 nucleotides), and those similar to the 5'-end representative 
sequence (the rest of the data). 
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[0159] All the keywords in the SwissProt data corresponding to the top hit data were selected, and the 5'-end rep- 
resentative sequences were classified by the keywords relating with functions. 
The keywords relating with a secretory protein or membrane protein are the followings: 

s growth factor, 

cytokine, 
hormone, 
receptor, 

G-protein coupled receptor, 
10 ionic channel, 

voltage-gated channel, 

calcium channel, 

extracellular matrix, 

transmembrane, and 
is signal. 

[0160] The keywords relating to glycoprotein is glycoprotein. 

[0161] The keywords relating to signal transduction are the followings: 

20 serine/threonine-protein kinase, 

tyrosine-protein kinase, and 
calmodutin-binding. 

[0162] The keywords relating to transcription are the followings: 

25 

transcription regulation and activator, 
transcription regulation and repressor, and 
nuclear protein and repressor. 

30 [0163] The keywords relating to diseases are disease mutation, and syndrome. 

[0164] Many keywords overlapped in the respective group (receptor and transmembrane, for example), and some 
keywords overlapped in different groups (secretory or membrane, and diseases, etc.). 

EXAMPLE 10 

35 

Selection of clones by characterization. 

[01 65] From the data obtained by the above characterization, clones encoding a novel secretory protein or membrane 
protein, or proteins with other predicted functions were selected by the combination of the ATGprl score and the 

40 prediction of the signal sequence by the PSORT, or according to the top hit data in the homology search of the SwissProt. 
[0166] In selecting the clones, the 5'-end sequences that are identical to a human mRNA were ignored, whereas 
those that are identical to a human mRNA in part but obviously not identical in the other part were included. Because 
there were clones selected that are identical to a human mRNA in part but obviously not identical in the other part. 
[0167] Also, if the finally selected clones were found to be not full-length compared with the sequences of human 

45 mRNA and ESTs, these clones were discarded. 

EXAMPLE 11 

A method for selection of clones by the combination of the ATGprl score and the prediction of the signal sequence by 
50 the PSORT (a method for selection of secretory proteins and membrane proteins that are novel and full-length). 

[01 68] The sequences of clones and the representative sequences of their clusters were used to obtain the maximal 
ATGprl score and predict the presence of the signal sequence. First, clones were selected based on the representative 
sequences of the clusters. The correspondence between the name and SEQ ID of the representative sequences used 
55 for selection (Table 368), and the correspondence between the name and SEQ ID of the introns (including the repre- 
sentative sequences of the 5'-end and 3'-end, and ESTs) used for selection of clones from the representative sequences 
of the groups (Table 369) were shown in the last part of the present specification. Therein, HRIFA and HRIRA indicate 
the representative sequence of the 5'-end group, and that of the 3'-end group, respectively. 
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[0169] In the clusters in which a single clone is contained (the sequence of the 5'-end clone = the representative 
sequence of the 5'-end) t selected were the clones that were judged to be full-length in comparison with human mRNA 
and ESTs, having the maximal ATGpM score 0.5 or higher, and predicted to contain the signal sequence, in principle. 
However, in the following cases, a clone having a longer 5'-end was selected: the maximal ATGprl score was less 
than 0.5, the sequence of the 5'-end was not full-length, the clone was obviously shorter although the clone was not 
classified into the same cluster according to the BLAST search of the other clones, or the 5'-end sequence correspond- 
ing to the 3'-end of the other clones in the same cluster in which the 3-end sequence of the clone was contained was 
found to be longer by assembling. Furthermore, if there were multiple full-length clones in the same cluster and it was 
not successful to determine by assembling which has the longer 5'-end, all the clones were selected. For assembling, 
the Sequencher<™) (Hitachi Soft Engineering ) was used. As a result, the signal sequence predicted to be present in 
the representative sequence was not present in some of the selected clones. In some cases, the ATGprl score became 
smaller than 0.5 or 0.3. The fullness ratio in these clones was low, yet still it is possible that the clones are full-length. 
The clones in which the signal sequence predicted to be present in the representative sequence was not present after 
selection were likely to be without the signal sequence, but still it is possible that the clones encode a membrane protein. 
[0170] In the clusters comprising multiple clones, in which the representative sequence of the 5'-end was predicted 
to contain the signal sequence, selected were the clones having the longest 5'-end sequence among the clones which 
were judged to be full-length compared with human mRNA and ESTs, having the maximal ATGprl score for the 5'-end 
sequence 0.5 or higher, and predicted to contain the signal sequence. However, in the following cases, a clone having 
a longer 5'-end was selected: the maximal ATGprl score was less than 0.5, the sequence of the 5'-end was not full- 
length, the clone was obviously shorter although the clone was not classified into the same cluster according to the 
BLAST search of the other clones, or the 5-end sequence corresponding to the 3'-end of the other clones in the same 
cluster in which the 3'-end sequence of the clone was contained was found to be longer. Furthermore, if there were 
multiple full-length clones in the same cluster and it was not successful to determine by assembling which has the 
longer 5'-end, all the clones were selected. As a result, the signal sequence predicted to be present in the representative 
sequence was not present in some of the selected clones. In some cases, the ATGprl score became smaller than 0.5 
or 0.3. The fullness ratio in these clones was low, yet still it is possible that the clones are full-length. The clones in 
which the signal sequence predicted to be present in the representative sequence was not present after selection were 
likely to be without the signal sequence at the 5'-end, but still it is possible that the clones encode a membrane protein. 
[0171] Next, in the clusters comprising multiple clones, in which the representative sequence of the 5'-end was 
predicted to have no signal sequence, selected were the clones which were judged to be full-length compared with 
human mRNA and ESTs, having the maximal ATGprl score for the 5'-end sequence 0.5 or higher, and predicted to 
contain the signal sequence. 

[0172] The number of the clones selected by the combination of the ATGprl score and the prediction of a signal 
sequence by the PSORT were 254. The number of the clones having the maximal ATGprl score 0.5 or higher, and 
predicted to contain a signal sequence were 170 (Table 7-10). Among the clones, 164 clones were found to have the 
representative sequence of the original cluster that fulfills the same conditions. On the other hand, 5 clones were 
selected from the representative sequences of the 5-end of the clusters which was predicted to contain a signal se- 
quence while the maximal ATGprl score was lower than 0.5. A clone was selected from the representative sequence 
of the 5'-end of the cluster which was predicted to have no signal sequence. 

[0173] The clones that have the maximal ATGprl score 0.3 or higher and less than 0.5 and predicted to contain the 
signal sequence were 35 clones (Table 11 ), in which 8 clones were found to have the representative sequence of the 
original cluster that fulfills the same conditions. Twenty-seven clones were selected from the representative sequences 
of the clusters which have the maximal ATGprl score 0.5 or higher and were predicted to have no signal sequence. 
[01 74] The clones that have the maximal ATGprl score less than 0.3 and were predicted to contain a signal sequence 
were 41 clones (Table 12). The clones that have the maximal ATGprl score 0.5 or higher and were predicted to have 
no signal sequence were 4 clones (Table 13). The clones that have the maximal ATGprl score 0.3 or higher and less 
than 0.5 and were predicted to have no signal sequence were 2 clones (Table 14). The clones that have the maximal 
ATGprl score less than 0.3 and were predicted to contain a signal sequence were 2 clones (Table 15). The represent- 
ative sequences of the original clusters of all the clones had the maximal ATGprl score 0.3 or higher, and were predicted 
to contain a signal sequence. 

[0175] The fullness ratio of the clones having the maximal ATGprl score 0.5 or higher, 0.3 or higher, and 0 or higher 
is expected to be as shown in Table 3, 4, 5, and 6. 



Table 7 
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Table 13 



Four clones from which selected clones have the maximal ATGprl score 0,5 or higher, 
and predicted to be lacking the signal sequence by the PSQRT 



name of clone 


name of sequence r 


raxinriaJ 


signal 


name of representative 


maximal 


signal 






TGprl score 




sequence 


ATGpM score 




NT2RP3002281 


F-NT2RP3002281 


0.81 


No 


HRIFA012999a 


0.61 


Yes 


NT2RP3002721 


F-NT2RP3002721 


0.94 


No 


HRIFA023305a 


0.57 


Yes 


NT2RP3004083 


F-NT2RP3004083 


0.94 


No 


HRIFA008387a 


0.76 


Yes 


PLACE1 005669 


F-PLACE1005669 


0.94 


No 


HRIFA012513a 


0.65 


Yes 



Table 14 



Two clones from which selected clones have the maximal ATGprl score 0.3 or higher and less 
than 0.5 and predicted to have no signal sequence by the PSORT 



name of clone 


name of sequence 


maximal 


signal 


representative 


maximal 


signal 






iVTGprl score 




sequence 


OTGprt score 




NT2RP3000481 


F-NT2RP3000481 


0.47 


No 


HRIFA028614a 


0.93 


Yes 


NT2RP3003559 


F-NT2RP3003559 


0.48 


No 


HRIFA025514a 


0.45 


Yes 



Table 15 



Two clones from which selected clones have the maximal ATGprl score 0 or higher and less than 
0.3, and predicted to have no signal sequence by the PSORT 



name of clone 


name of sequence 


'maximal 


signal 


{representative 


maximal 


signal 






VTGprl score 




sequence 


MGprl score 




PLACE1 005601 


F-PLACE1 005601 


0.12 


No 


HRIFA0 10593a 


0.64 


Yes 


PLACE1006786 


F-PLACE1 006786 


0.22 


No 


HRIFA01 2333a 


0.51 


Yes 



EXAMPLE 12 

A method for the selection of clones based on the top hit data in the homology search against the SwissProt (a method 
for the selection of a novel full-length protein that is predicted to have a function based on the top hit data). 

[0176] The representative sequences of the clusters were discarded in which the 5'-end sequence is identical (90% 
or more matching), or not similar (the compared part contains a sequence of 25 nucleotides or shorter and the similarity 
is lower than 60%) to the top hit data in the SwissProt. Then, the remaining representative sequences which has 
similarity to the representative sequences of the 5'-ends were classified by a group of the above keywords (some 
representative sequences belong to a group by multiple keywords), and then clones were selected from the clusters. 
The names and the corresponding SEQ IDs of the representative sequences, and also the names of the introns (in- 
cluding the representative sequence of the 5'-end or the 3'-end, or ESTs) used for selecting the clones from the rep- 
resentative sequences and the corresponding SEQ IDs are shown in the last part of the present specification (Table 
368 and 369, respectively). HRIFA indicates the representative sequence of the 5'-end group, and HRIRA indicates 
the representative sequence of the 3'-end group. 

[0177] In principle, from the clusters containing only a single clone (the 5'-end sequence is the representative se- 
quence of the cluster), the clone was selected. However, in the following cases, the clone containing a longer 5-end 
was selected: where the maximal ATGprl score was less than 0.5, the 5'-end sequence of the clone to be selected 
was not complete, or the 5'-end of the clone was found to be obviously short nevertheless the clone should not be 
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included in the same cluster based on the BLAST analysis between the other clones, or further, the 5'-end sequence 
of the said clone, which corresponds to the 3'-ends of the other clones belonging to the same cluster in which the 3'- 
end of the said clone was included, was turn out to be longer than those of the other clones by assembling them. When 
there were two clones in the same cluster, judged to be full-length, and it was difficult to determine which clone has 
5 the longer 5'-end even by assembling them, all the clones were selected. As a result, the ATGprl score in some clones 
became less than 0.5 or less than 0.3. The fullness ratio of these clones became lower, but there is still a possibility 
that the clones are full-length. 

[0178] In the case in which multiple clones were contained in a cluster, selected was the clone having the longest 
5'-end in the clones judged to be full-length compared to the human mRNA or human EST. However, in the following 

10 cases, the clone containing a longer 5'-end was selected: where the maximal ATGprl score was less than 0.5, the 5'- 
end sequence of the clone to be selected was not complete, or the 5'-end of the clone was found to be obviously short 
nevertheless the clone should not be included in the same cluster based on the BLAST analysis between the other 
clones, or further, the 5'-end sequence of the said clone, which corresponds to the 3'-ends of the other clones belonging 
to the same cluster in which the 3-end of the said clone was included, was turn out to be longer than those of the other 

is clones by assembling them. When there were two clones in the same cluster, judged to be full-length, and it was difficult 
to determine which clone has the longer 5'-end even by assembling them, all the clones were selected. As a result, 
the ATGprl score in some clones became less than 0.5 or less than 0.3. These clones can still be full-length. 
[0179] Based on the top hit data in the SwissProt homology search, 658 clones were selected. Among them, 446 
clones were selected by the keywords, secretion or membrane. Using the keyword, glycoprotein, 243 clones were 

20 selected. 51 clones were selected by the keywords for signal transduction. With the keywords for transcription, 130 
clones were selected. 17 clones were selected by the keywords for disease. 

[0180] Among the 446 clones selected by the keywords, secretion or membrane, 77 clones were overlapped with 
those selected by combining the ATGprl score and prediction by the PSORT for the existence of a signal sequence. 
Also, many clones were overlapped with those selected by the keyword, glycoprotein. Moreover, some clones were 

25 overlapped with the clones selected by the keywords for diseases. 

[0181] Among the 243 clones selected by the keyword, glycoprotein, 53 clones were overlapped with those selected 
by combining the ATGprl score and prediction by the PSORT for the existence of a signal sequence. Also, many clones 
were overlapped with those selected by the keywords, secretion or membrane. Moreover, some clones were overlapped 
with the clones selected by the keywords in diseases. 

30 [0182] Among the clones selected by the top hit data in the homology search on the SwissProt, 532 clones were 
having the maximal ATGprl score 0.5 or higher. 59 clones were having the maximal score 0.3 or higher and less than 
0.5. 67 clones were with the maximal score less than 0.3. 

[0183] When the maximal ATGprl score is 0.5 or higher, 0.3 or higher, no less than 0, the expected fullness ratio is 
as shown in Table 3, 4, 5, and 6, respectively. 

35 

Table 16 



40 



45 



55 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "growth 


factor", "cytokine", or "hormone", and the selected clones. 


name of clone 


name of representative sequence 


HEMBA1001563 


HRIFA001439a 


HEMBA1 003047 


HRIFA002743a 


HEMBA1 005070 


HRIFA020144a 


HEMBA1 006724 


HRIFA021620a 


HEMBA1006916 


HRIFA021855a 


MAMMA1001066 


HRIFA027355a 


MAMMA1001634 


HRIFA027187a 


MAMMA1002165 


HRIFA027673a 


NT2RM4000326 


HRIFA032530a 


NT2RM4001377 


HRIFA005300a 


NT2RP2000447 


HRIFA006448a 


NT2RP2000663 


HRIFAO06609a 


NT2RP2000903 


HRIFA006798a 


NT2RP2002974 


HRIFA027860a 


NT2RP2003369 


HRIFA008596a 


NT2RP2004141 


HRIFA009123a 
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Table 1 9 (continued) 



10 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"transmembrane", and the selected clones (except overlapped with Table 16, 17, and 18) 


name ot cione 


noma /-»f ror^roeontativ/o eoni lOHPO 

name ot represcniaiivc ocljuciiuc 


"tljvd/"^ aaa>i aa 
1 nYHL/lUUU4UU 


nrtlrAUUU0b4a 


TUX/D/™^ AAACO A 

I HYHU iUUUt>o4 


UDiCAnooonQi 

nnirAU^y^uya 


"TLIX/D/"^ A A A£"7£» 


LJDIETA AOdOCCo 

rlHIrAO^y^oba 


1 HYRO1000776 


HrllrAU^yol /a 


THYR01 000795 


nHlrA02y*3<i7a 


-r"i_j \y r"}/""\ A A A A OC C 

1 HYROIOOOobo 


UDICA AT7~7 H >1 « 


THYRO1001113 


MHIrA0^y4b0a 


THYRO1001128 


nnlrA0£y4b7a 


THYRO1001242 


HRIFA032360a 


THYRO1001266 


HRIrA030264a 


THYRO1001456 


HRIFA030370a 


THYRO1001529 


HRIFA030411a 


THYRO1001702 


HRIFA030511a 


Y79AA1000127 


HRIFA026121a 


Y79AA1 000270 


HRIFA005644a 


Y79AA1001426 


HRIFA028651a 


Y79AA1001787 


HRIFA028790a 


Y79AA1001799 


HRIFA032070a 


Y79AA1002213 


HRIFA032224a 



Table 20 



35 



45 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "signal", 


and the selected clones (except overlapped with Table 16, 17, 18 and 19) 


name of clone 


name of representative sequence 


BNGH41 000087 


HRIFA029508a 


HEMBA1000128 


HRIFA000123a 


HEMBA1 000443 


HRIFA000415a 


HEMBA1 000590 


HRIFA000553a 


HEMBA1 000634 


HRIFA004780a 


HEMBA1 000745 


HRIFA000695a 


HEMBA1001221 


HRIFA001132a 


HEMBA1001228 


HRIFA001138a 


HEMBA1001390 


HRIFA000071a 


HEMBA1002131 


HRIFA001942a 


HEMBA1002167 


HRIFA001975a 


HEMBA1002178 


HRIFA001984a 


HEMBA1 002524 


HRIFA002284a 


HEMBA1 002992 


HRIFA002694a 


HEMBA1 003072 


HRIFA002762a 


HEMBA1003315 


HRIFA000016a 


HEMBA1 003487 


HRIFA003055a 


HEMBA1 003530 


HRIFA003093a 


HEMBA1005145 


HRIFA003946a 


HEMBA1 005337 


HRIFA019651a 


HEMBA1 005449 


HRIFA020707a 


HEMBA1 005522 


HRIFA021398a 
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Table 20 (continued) 



10 



20 



35 



40 



45 



50 



55 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "signal", 


and the selected clones (except overlapped with Table 16, 17, 18 and 19) 


name ot cione 


Maine Ul lejjf cocl 1 la live ocijuciH/C 


l 1 1~ ft. < n a a AACOOC 

HtMbAl OUbodb 


HrllrAUl £(Jbya 


HbMbAl UUbo/d 


UIDICAAOH CAIa 

nnlrAU^lo4oa 


HbMbAl 00b7U7 


UDICArtO ■( >l A An 

M H 1 rAU^ 1 4o » 3 


1 1 r~ A < P A A A AC "7 ACi 

HEMBA1 006749 


HnlrAU^loo/a 


UmjID A H A AC A AO 

HEMBAI 006902 


MnlrAU^l /043 


HcMBAl 00701 3 


UDICAAO 4 Q/\Cn 

MHIrAO^iyUba 


HEMBA1 007057 


l_l D 1 C A AO On Q C i-» 

n H 1 rAU^^y oba 


AAA >l A ~7 

HEMBB1 000447 


MnlrAOOloooa 


HEMBB1 000567 


HRIFA02 9730a 


i i t~ a Ann a nnnaa a 

HEMBB1000881 


HRIrA02 9932a 


HEMBB1001026 


HRIFA030025a 


LirUDDH A AH f\AO 

HEMBB1 001048 


UDICA AO A A AC n. 

HHIrA03004oa 


HEMBB1001847 


HRIPA03 1249a 


ft A A ft Aft 1 A j /*\/* 

MAM MA 1000 106 


HRIFA024482a 


MAMMA1 000226 


HRIFA025978a 


MAMMA1 000591 


unirA ao co ao #i 

HRIFA026303a 


ft J Aft 4 ft J A 4 /\f\A A J A 

MAMMA1001043 


HRIFA026764a 


ft J Aft J ft J A 4 

MAMMA1001957 


ynirA AO"7COOm 

HRIFA027536a 


MAMMA1 002080 


HRIFA01 6963a 


ft A Aft a ft j a 4 a 

MAMMA1 002234 


HRIFA027722a 


MAMMA1 002633 


1 1 ^% 1 ^ A AAA A /** ^ M 

HRIFA030461 a 


ft A A ft J ft A A 4 

MAMMA1003126 


□ nirA AAAA Af). 

HRIFA029263a 


NT2RM 1000462 


HRIFA004426a 


NT2RM1 000542 


HRIFA004490a 


NT2RM2000410 


HRIFA022055a 


It t*m 1^ ft JA AA A i AH 

NT2RM2000423 


unirA aaaa c 

HRIFA022065a 


NT2RM2000622 


1 mi PA AAA4 O 

HRIFA022156a 


NT2RM2000773 


1 1 l~i 1 1~ A AOOO A>1 <-> 

HRIrA023o94a 


NT2RM2001626 


HRIFA028911a 


ft. I^P A f~\ ft ftf^ S\ S\ 4 04 A 

NT2RM2001818 


HRIFA031935a 


ft. i^n a ft a A A *"l 

NT2RM4000648 


i i niPA /\oo^oo*» 

HRIFA032730a 


X ITA l"1 ft IJAAJ A J A 

NT2RM4001843 


UDICA AO A ~T A O n 

HRIFA024718a 


MTODDH AAAACA 

NT2RP1 000050 


MnlrAUOOlU^a 


K ITO O D H A A -4 A A 

NT2RP1001004 


n n 1 rAUUb ( d\)di 


K ITOD DO A A AO A A 

IMT2RP2Q00394 


UDICA AAOCX An 

n n 1 rAUUob4Ua 


MTODDOAAACH A 

NT2HP2000514 


n n 1 rA0Ub4y4a 


mtoddoaah /ton 

NT2RP200148Q 


UDICA AA"70 H An 

MnlrAOU/iiiya 


MTODDOAA1 7CC 


|_| D I C A A CilA OA ?a 

n n 1 rAUU / 4^:43 


NT2RP2001878 


HRIFA007512a 


NT2RP2002188 


HRIFA007745a 


NT2RP2002564 


HRIFA007244a 


NT2RP2002824 


HRIFA008200a 


NT2RP2003042 


HRIFA008362a 


NT2RP2003593 


HRIFA008252a 


NT2RP2003931 


HRIFA008976a 


NT2RP2004606 


HRIFA009451a 


NT2RP2004648 


HRIFA009482a 
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Table 20 (continued) 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "signal", 


and the selected clones (except overlapped with Table 16, 17, 18 and 19) 


n<4rv\a a^ a 1 a a a 

name or cione 


name ot repreberHauvt? bequciiuc 


N I 2Hr2UU5l bo 


nnlrAUU9o2oa 


MTOD DO Art CO A T 

NT2RP2Q05247 


LJDICA AAAQQ1 fs 

nnlrAUUyool a 


N 1 2Hr 2005378 


HnlrAUU4yiya 


n ito r*» piaaac c a a 

NT2RP2005541 


UDICAftH AAA A n 

HnlrAOl 0090a 


t> it a r*»r*»AAAfroAA 

NT2RP2005883 


UDtTA A4 AO 

HRIFA01 0319a 


k ito r*t n a a a a a An 

NT2RP2006042 


HnlrAOl 0425a 


NT2RP3000063 


HRIFA02252oa 


NT2RP3000436 


HRIFA02277oa 


A, I^Trt^^ A AAA AAA 

NT2RP3000444 


HRIFA022782a 


■k t~rg*\ ^\a a a a a t*\ 4 

NT2RP3000481 


1 1 PI 1 A A A A A -4 j« — 

HRIFA028614a 


k i^prt ^\ nA a aa^a a 

NT2RP3000721 


1 1 r*> 1 ^ A A A A n /"> C" «. 

HRIFA009825a 


k i^r* a r\ f""i a a a a a a 

NT2RP3001012 


1 1 r*% 1 r" A AAA -4 A A ^ 

HRIFA023129a 


NT2RP3001159 


i ir^ i r" a An Art -4 a _ 

HRIFA023212a 


NT2RP3001592 


1 1 ^\ 1 ^ A AAA A f+ A — 

HRIFA023464a 


NT2RP3001754 


| 1 f—\ 1 |— AAA A A _ 

HRIFA007728a 


NT2RP3002311 


HRIFA024718a 


NT2RP3002738 


HRIFA020748a 


ib i ^\ A aa^a A 

NT2RP3002790 


1 1 1 I - A A A AC* 4 A — 

HRIFA026519a 


NT2RP3002887 


HRIFA029278a 


NT2RP3003354 


1 1 1~\ IT A A A A A «-l A 

HRIFA008212a 


NT2RP3003448 


HRIFA025479a 


NT2RP3003473 


1 imr*AAA4 A A A 

HRIFA001413a 


NT2RP3003614 


HRIFA032642a 


NT2RP3004075 


1 1 1 r* A A A A A A A 

HRIFA010301a 


NT2RP3004090 


HRIFA027329a 


NT2RP3004202 


• i ^> | *^ A A A r~ A A^T _ 

HRIFA025327a 


NT2RP3004309 


HRIFA025778a 


NT2RP3004345 


i i r™> i ^ a aa^a^a— . 

HRIFA02 5353a 


NT2RP3004557 


| 1 A| ^ A AA^AA^* 

HRIFA025907a 


NT2RP4001467 


1 1 1 ^ A A A A A*"*A 

HRIFA0 13276a 


OVARC 10003 13 


HRIFA011016a 


t A f*^ S*\ A AAA J 4 A 

OVARC 1000410 


1 11^ 1 t*" A AA A A A A _ 

HRIFA022691a 


t A ^\ 4 AAA iAA 

OVARC 1000439 


1 1 #~> 1 f™ A A-4 -4 -i AC -» 

HRIFA011105a 


/"\\ » a r-\ j a a j a a a 

OVARC1001086 


LimrAAH A C OA n 

HRIFA01 1580a 


OVARC1QQ1569 


unirA aoo"70o« 
HHIrA02272oa 


OVARC1001570 


l_ir->ir~AA4 A/1 A A n 

HHIhAU 194 123 


HI A f - -4 A A H /4 A~7 

PLACE1001407 


UDITAA't OACAa 

nnlrAOi 20593 


PLACE1001464 


1 1 ft 1 r~ A A -4 Q07Cn 

MHIrAU 1o2 /oa 


PLACE1001516 


HRIFA012702a 


PLACE1001795 


HRIFA012885a 


PLACE1001918 


HRIFA012969a 


PLACE1 002080 


HRIFA0130921a 


PLACE1002153 


HRIFA013135a 


PLACE1 002355 


HRIFA013254a 


PLACE1 002374 


HRIFA013265a 


PLACE 1002726 


HRIFA01 8688a 


PLACE1 003428 


HRIFA013911a 
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Table 20 (continued) 



10 



25 



35 



40 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "signal", 


and the selected clones (except overlapped with Table 16,17,18 and 1 9) 


MalMC UI OIUIIC 


namp nf rpnrpQpntativp ^pmipncp 


rLAL» b 1 UUo4bU 


UR1PAH1 IQlOa 

nnirwu i oyota 


rLAUb lUUof I c 


nnl rnu 1 H 1 OOa 


□ 1 AOC^ f\(\A A*7Q 

r LAO b 1 UU4U / o 


n n I rnU 1 *fOODa 


rLAUb lUU45^U 


nnlrnU l*tot I a 


PLAObl 004648 


nmrAUi4/u^a 


DI APCH AA/1 007 

rLAOblUU4oo/ 


nnirrtu i *+oooa 


rLAOb 1 00542b 


nnirAU i o^'i-oa 


DI A /"> P'l AACA7H 

rLACb 100b071 


nnirMU i bboya 


PLACE 1006443 


HHIrAUl oyu^a 


PLACE1006716 


nnl r au i bu / ua 


PLACE1 006959 


nnlrAUl od 143 


DI A^CH AA7A77 

PLACE1 007077 


nnlrAUl bboya 


PLACE 1007081 


HHIrAUl b^yua 


PLACE 1007702 


nnlrAU tbbbya 


PLACE1 008657 


nnlrAUl 7<d5/a 


PLACE 1008744 


HHIrAUl /ol^a 


PLACE1 009546 


nnlrAUl /oUia 


PLACE1011116 


UDICAAI Q~7EZAn 

HHIrAUl o754a 


PLACE1011708 


HHIrAUl yiUba 


rti a Ar*^A/\A44n 

PLACE2000118 


H H 1 rAU^4y y4a 


PLACE3000213 


HHIrAUl 54oba 


PLACE4000354 


UDICAAI r/|OC n 

HHIrAU l54oba 


THYR01 000061 


UDICAAI 007A /-» 

HHIrAU io^/^ya 


THYROI 000846 


HHIrAU^yd4ya 


THYRO1001063 


HRIFA029434a 


THYRO1001608 


HRIFA050456a 


THYRO1001803 


HRIFA030566a 


Y79AA1 000876 


HRIFA030629a 


Y79AA1001090 


HRIFA028511a 


Y79AA1001272 


HRIFA028576a 


Y79AA1001727 


HRIFA006642a 


Y79AA1001803 


HRIFA032073a 


Y79AA1 002376 


HRIFA032820a 



Table 21 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"glycoprotein", and the selected clones 




name of clone 


name of representative sequence 


BNGH41 000087 


HRIFA029508a 


BNGH41 000091 


HRIFA029511a 


HEMBA1 000275 


HRIFA000264a 


HEMBA1 000349 


HRIFA000327a 


HEMBA1 000590 


HRIFA000553a 


HEMBA1 000634 


HRIFA004780a 


HEMBA1 000835 


HRIFA000776a 


HEMBA1 000907 


HRIFA000845a 
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Table 21 (continued) 



w 



20 



30 



35 



40 



45 



50 



55 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"glycoprotein", and the selected clones 




name of clone 


name of representative sequence 


HEMBA1001221 


HRIFA001132a 


HEMBA1001228 


HRIFA001138a 


HEMBA1001621 


HRIFA001489a 


HEMBA1 002131 

■ 1 ^— 1 V 1 LJr^ 1 \J\J ^ 1 \J 1 


HRIFA001942a 

1 II 111 #V W W 1 ~ t. c* 


HFMBA10021R4 


HRIFA001972a 

1 II 111 r\\J w 1 ^ / bQ 


HEMBA1002167 


HRIFA001975a 


HEMBA1002178 


HRIFA001984a 


HEMBA1002316 


HRIFA002102a 

1 II 111 W W b f UbU 


HEMBA1 002421 


HRIFA005392a 

1 II III r\W\S\J&^Q 


HFMRA10027R7 


HRIFA002503a 

1 Mill nuk/LJUOCl 


HFMRA1 003047 


HRIFA002743a 


HFMRA1003101 

rlCIVIDM 1 \J\J\J 1 U 1 


HRIFA002787a 


HFMBA1 003230 


HRIFA002891a 

i ii hi f\\j\j c~KJ\s i a 


HFMRA10033Q? 


HRIFA002980a 

i 1 1 1 1 1 nuUb^uv/ci 


HFMRA1 004P50 


HRIFA003504a 


HFMRA1 004/301 


H R 1 FA0P0RQ3a 


HFMRA 1004444 


HRIFA02Q285a 

I Mill nV/bUfaUt/U 


HFMRA1 004454 

n CI VI DM 1 


HRIFA0035QPa 


HFMRA1 004505 


HRIFA003R35a 


HFMRA100544Q 


HRIFA0?0707a 


MPMRA1 005dflQ 
n ci vi dm i uuvjtoy 


HRIFA0P4543a 


HFMRA10055P? 


HRIFA02 1398a 

1 Mill r\\JC- 1 J^UQ 


HFMRA1 005545 


HRIFA0?0?7?a 


HFMRA1 00R335 


HRIFA0t20fiQa 


HFMRA100R57? 


HRIFAOP 1543a 
I i n i rnwL i jtua 


HFMRA1 00R707 

n CI VI DM 1 UUD f U f 


HRIFA0?14QQa 


HFMRA100R794 

n CI VI DM 1 UUD f tt 


HRIFA091 RPOa 


HFMRA100R74Q 


HRIFA021R37a 

1 Mill fWJ ^ 1 UJ r Q 


HFMRA100RQ02 


HRIFA021754a 

i i n 1 1 nut. i i 


HFMRA1 007057 


HRIFA022985a 


HFMBA1 007332 


HRIFA022493a 

i Mill nubb^sua 


HFMRB1 000447 

1 1 1 1 VI l_J l_> 1 uuutt / 


HRIFA001 558a 

i ii hi nuy i \j a 


HFMBB10005R7 


HRIFA029730a 

1 II 111 t\\J Lm, C / UVQ 


HEMBB1 000679 


HRIFA029802a 

i ii hi nv/L^uwbu 


HFMBB1000R81 

1 1 1— 1 V 1 l_J l_J 1 UvUUU 1 


HRIFA029932a 

1 II ill f\\J^ w w^ VtwU 


HEMBB1001048 

1 1 ^1 VI LJ LJ 1 w w • W t ^ 


HRIFA030045a 

i ii hi w 7 ^ yj w^wc* 


H EM BB 1002427 


HRIFA005760a 


MAMMA1000106 


HRIFA024482a 


MAMMA1 000403 

1 V 1 /»l V 1 1 w 1/* 1 W w W J w w 


HRIFA026465a 


MAMMA1 000591 


HRIFA026303a 


MAM MA 1000681 


HRIFA026364a 


MAMMA1 000706 


HRIFA026382a 


MAMMA1001043 


HRIFA026764a 


MAMMA1001237 


HRIFA026860a 


MAMMA1001615 


HRIFA022865a 


MAMMA1001893 


HRIFA027485a 


MAMMA1001978 


HRIFA027549a 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 




ATGpM 0.5 or higher. 




name of clone 


name of sequence 


maximal ATGprl score 


HEMBA1 005522 


F-HEMBA1 005522 


0.73 


HFMRA1 005545 
n l_ ivi dm i uuuj*tu 


F-HEMBA1 005545 

■ i iLiviun ivujju 


0.94 


HFMBA1005698 


F-HEMBA1 005698 


0.68 


HFMBA1 005929 


F-HEMBA1 005929 


0.72 


HFMBA1 005945 


F-HEMBA1 005945 


0.80 


HFMRA1 006276 


F-HFMBA1 006276 

1 n C.IVIDM 1 UUUt f u 


0.50 


HFMBA1 006299 

1 1 l_ IVI LJ 1 wUb<79 


F-HFMBA1 006299 


0.94 


HFMRA1 006335 


F-HFMRA1 006135 


0.94 


HFMRA1 006430 


F-HFMBA1 006430 

1 1 ILIVIUniUuUTOU 


0.93 


HFMRA1 00R4R9 


F-HFMRA1 0064R2 

% nCIVIDn 1 uuotoc 


0.59 


HFMRA1 00R517 

n L_ IVI DM 1 UUDJ 1 / 


F-HFMRA1006517 
r ntiviDn i uudj i / 


0.68 


HPMRA1 00R544 


F-HFMRA1 006544 
r ncivi dm i uuuu*t*t 


0 94 


HFMRA1 00R579 

il dVIDM I UUUU 1 £ 


F-HFMRA1 006572 
i ntiviDn luuujf c. 


0.62 


HFMRA1 00R707 

n C IVI DM I UUD/ U f 


F-HFMRA1 006707 

I n C IVI DM 1 UUO / U / 


0 94 


HPMRA1 00R794 
11 CI VI DM I UUD / t*r 


F-HFMRA 1006794 

1 TILlVIDn 1 UUU f t*r 


0.80 


ri tfvtDM l UUO / 


P_MPIUlRA1O0fi74Q 
r-nCIVIDM I UUO /Hi? 


0 Q4 


HFMRA1 00RQ09 

II L_IVI DM I UUOUUt 


F-HFMRA1 006909 
i ntiviDn i uuusut 


0.94 


OORQ1 fi 
rlCIVIDM I UUOcf I D 


F-HFMRA1 006Q1 R 
r riLivi dm i uuDa i u 


0 RO 
u.ou 


MPMRA1 00701 3 
n El IVI DM I UU / u i o 


F-HFMRA1 00701 3 

1 n d IVI DM 1 UU/ U 1 o 


0.82 


n Civl DM I UU / uo / 


F-HFMRA1 007057 
r riLivi dm i uu / uj / 


0 94 


nClVlDM I UU / C.C.KS 


P.MPMRA1 00799R 
i ll CI VI DM I UU/ LCU 


0 50 

U.JU 


MPMRA1 007941 
rlCIVIDM I UU / 1*+ 1 


F-HFMRA1 007941 

1 (ILlVl DM I UU / t*+ 1 


0 94 


MPMRR1 0001 OR 
nCIVIDD I UUU I.Uu 


F-HFMRR1 0001 OR 

1 11 dvl DD 1 UUU 1 UU 


0.94 


MPMRR1 OOOA47 
nCIVIDD I uuu^f**/ 


F-HFMRR1 000447 

r nCIVIDD 1 UUUH-H/ 


0 73 
u. / o 


MPMRR1 OOORRR 
nCIVIDD I uuuooo 


F-HFMRR1 OOORRR 
1 TILlVIDD 1 uuuuuo 


0.50 


MPMIRR1 OOOR7Q 

nciviDD i uuuo / y 


P-MPMRR1 000R7Q 
r nLiviDD i uuuo/ y 


0 91 


nciviDD 1 UUUOO 1 


F-HFMRR1000RR1 
r nciviDD t uuuoo i 


0 77 
u. / / 


HFMRR1001 09R 

nCIVIDD i UU I U£U 


F-HFMRR1 001 096 
r*riLiviDD i uu i u£u 


0.94 


MPMRR1 001 04R 

IILllVIDD 1 UU 1 UH-O 


F-HFMRR100104R 
i n U.IVI dd i uu i u*to 


0 RR 
u.ou 


HFMRR1001 POO 

nCIVIDD 1 UU 1 c.\J\J 


F-HFMRR1001900 
r iilividd iuu i tuu 


0.81 


HFMRR1 001 573 
nuviDu i uu i j/o 


F-HFMRR1001 571 

1 11 t IVI DD IUU I \J f o 


0 RO 
u.uu 


HFMRR1 001 R47 

II L-IVI D D 1 UU 1 OH / 


F-HFMRR1001 R47 
r nLiviDu iuu i o*t / 


0.81 


HFMRR1001959 

nCIVIDD 1 UU 1 \J\J\J 


F-HFMBB1 001 959 
r rmiviDu < uu i sjs 


0.94 


HFMRR1 002041 


F-HFMRR1 002041 

i IILIVIUD 1 UUCU^ 1 


0.79 


HFMRR1009051 

IILIVIDD lUUtUJ 1 


F-HFMBB1 002051 

I 1 IL.IVIUU lUULUJ 1 


0.60 


HFMRR1 002102 

nCIVIDD 1 UUcuUt 


F-HFMRR1 002302 
i n^iviDw luucoUb 


0.89 


HFMRR1 009427 


F-HFMBB1 002427 


0.94 


HFMBB1002661 

1 1 1— 1 V 1 LJ LJ 1 VSV/d \J\J 1 


F-HEMBB1002661 


0.94 


MAMMA1000106 

IVIMI VI IVIM 1 UUU 1 UU 


F-MAMMA1 0001 06 

1 1 V 1 /Al V M V 1 f \ 1 \J\J\J 1 \J\J 


0.78 


MAMMA1 000204 


F-MAMMA1 000204 


0.94 


MAMMA1 000226 


F-MAMMA1 000226 


0.94 


MAM MA 1000403 


F-MAMMA1 000403 


0.59 


MAMMA1 000473 


F-MAMMA1 000473 


0.86 


MAMMA1 000496 


F-MAMMA1 000496 


0.70 


MAMMA1 000591 


F-MAMMA1 000591 


0.77 


MAMMA1 000681 


F-MAMMA1 000681 


0.94 


MAMMA1 000788 


F-MAMMA1 000788 


0.83 
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EXAMPLE 13 

Selection of cDNA clone NT2RP2036580 

5 [0184] Clone NT2RP2006580 as well as clone HEMBA1000121 was selected from the representative sequences 
belonging to HRIFA000116a cluster of the most homologous sequence in the SwissProt with the keywoods "trans- 
membrane". Although each of the clones, HEMBA1000121 and NT2RP2006580, was assembled with other clones for 
5' extension, any other clones did not extend the clones toward the 5' direction. Accordingly, it is possible that both 
clones are full-length cDNA clones. The maximal ATGpM score of F-NT2RP2006580 is 0.37, and therefore, the fullness 

10 ratio is low. However, it is still possible for the sequence to cover the full-length. 

[0185] Thus, the total number of selected clones is 830. Based on the top matching data resulted from Swiss-Prot 
homology search, 659 clones were selected. From them, 447 clones were selected by the keywords of "secretion" and 
"membrane". Among the clones selected based on the top matching data, 60 clones exhibited the maximal ATGprl 
score of 0.3 or higher and less than 0.5. 

15 [0186] The sequences of F-NT2RP2006580 and R-NT2RP2006580 are shown in SEQ ID NO: 2545 and SEQ ID 
NO: 2546, respectively. 

EXAMPLE 14 

20 Full-length sequence analysis and homology search 

[01 87] Full-length sequence was determined for each selected cDNA clones. The nucleotide sequence determination 
was performed mainly by the dye-terminator method using custom synthesized DNA primers according to the primer 
walking procedure (custom synthesized DNA primers were used for sequencing; sequencing reaction was performed 

25 with DNA sequencing reagent supplied by PE Biosystems according to the supplier's manual; and the samples were 
analyzed in an automatic sequencer made by the same supplier). Sequence determination of some clones was carried 
out in the same manner but using a Licor DNA sequencer. Overlapping partial nucleotide sequences, which were 
obtained by the above-described method, were assembled together to determine a full-length nucleotide sequence. 
Amino acid sequences were then deduced from the determined full-length nucleotide sequences. However, amino acid 

30 sequence is not shown for a clone of which coding region was hard to be deduced or of which amino acid sequence 
has less than 1 00 amino acid residues. SEQ ID NOs corresponding to the respective clones are indicated in Table 370. 
[01 88] GenBank, Swiss-Prot and UniGene were searched for the determined nucleotide sequences by BLAST anal- 
ysis. Matching data of cDNA clone which exhibits higher homology and of which functions are easily predicted based 
on the nucleotide sequences and the deduced amino acid sequences are selected from the BLAST analysis matching 

35 data with P value of 1 0 -4 or less. The matching data selected are listed herein. However, there are some clones that 
did not match the criteria for judgment and such matching data of BLAST analysis are not shown herein. The results 
of homology search indicated in the last part of this specification are as follows. 

[0189] Homology search result 1 : data obtained by the homology search of Swiss-Prot database for representative 
sequences of the 5'-end cluster 
40 [01 90] Homology search result 2: homology of representative sequences of the 5'-end cluster to the data in Swiss- 
Prot database; the P value is 10 -10 or less 

[0191] Homology search result 3: homology of representative sequences of the 5'-end cluster to the data in Swiss- 
Prot database; the P value is higher than 10~ 10 and 10~ 4 or less 

[01 92] Homology search result 4: homology of representative sequences of the 5'-end cluster to the data in Swiss- 
45 Prot database; the P value is higher than 10" 4 and 1 or less 

[01 93] Homology search result 5: data obtained by the homology search of Swiss-Prot database for 5-end sequences 
of cDNA clone 

[01 94] Homology search result 6: data obtained by the homology search of GenBank database (http:/lwww.ncbi.nlm. 
nih.gov/web/GenBank/) except for EST and STS sequence data for 5*-end sequences of cDNA clone 
50 [01 95] Homology search result 7: data obtained by the homology search of GenBank database (http://www.ncbi.nlm. 
nih.gov/web/GenBank/) except for EST and STS sequence data for 3'-end sequences of cDNA clone 
[01 96] Homology search result 8: data obtained by the homology search of Human UniGene database (http://www. 
ncbi.nlm.nih.gov/Unigene) for 5'-end sequences of cDNA clone 

[0197] Homology search result 9: data obtained by the homology search of Human UniGene database (http://www. 
55 ncbi.nlm.nih.gov/Unigene) for 3'-end sequences of cDNA clone 

[01 98] Homology search result 1 0: result obtained by the homology search for full-length nucleotide sequences and 
deduced amino acid sequences 

[01 99] The P value indicates similarity between two sequences as a score by considering the probability that the two 
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sequences are accidentally similar. In general, as the value is lower, the similarity is higher. In general, as the value is 
lower, the homology is higher (Altschul, S.R, Gish, W., Miller, W., Myers, E.W. & Lipman, D.L (1990) "Basic local 
alignment search tool." J. Mol. Biol. 21 5:403-41 0; Gish, W. & States, D.J. (1 993) "Identification of protein coding regions 
by database similarity search." Nature Genet. 3:266-272). 

5 

Example 15. Gene expression analysis with hybridization using high density DNA filter 

[0200] Nylon membrane for DNA spotting was prepared according to the following procedure. E. coli was cultured 
in each well of a 96-well plate (in a LB medium at 37. for 16 hours). A sample of each culture was suspended in 10 . 

10 1 of sterile water in a well of a 96-well plate. The plate was heated at 100. for 10 minutes. Then, the boiled samples 
were analyzed by PCR. PCR was performed in a 20 .1 solution by using TaKaRa PCR Amplification Kit (Takara) ac- 
cording to the supplier's protocol. Primers used for the amplification of an insert cDNA in a plasmid were a pair of 
sequencing primers, ME761 FW (5' tacggaagtgttacttctgc 37 SEQ ID NO: 3591 ) and ME1250RV (5' tgtgggaggttttttctcta 
37 SEQ ID NO: 3592), or a pair of primers, M13M4 (5' gttttcccagtcacgac 37 SEQ ID NO: 3593) and M13RV (5' cag- 

15 gaaacagctatgac 3' / SEQ ID NO: 3594). PCR was performed using a thermal cycler, GeneAmp System 9600 (PE 
Biosystems) at 95. for 5 minutes; at 95. for 10 seconds and at 68. for 1 minute for 10 cycles; at 98. for 20 seconds and 
at 60. for 3 minutes for 20 cycles; and at 72. for 1 0 minutes. After the PCR, the 20 .1 reaction solution was loaded onto 
a 1% agarose gel and fractionated by electrophoresis. DNA on the gel was stained with ethidium bromide to confirm 
the amplification of cDNA. When cDNAs were not amplified by PCR, plasmids containing the corresponding insert 

20 cDNAs were prepared by the alkali-extraction method (J. Sambrook, E.F., Fritsh, & T. Maniatis, "Molecular Cloning, A 
laboratory manual/ 2nd edition, Cold Spring Harbor Laboratory Press, 1989). 

[0201 ] Preparation of DNA array was carried out by the following procedure. A sample of a DNA solution was added 
in each well of a 384-well plate. DNA was spotted onto a nylon membrane (Boehringer) by using a 384-pin tool of 
Biomek 2000 Laboratory Automation System (Beckman-Coulter). Specifically, the 384-well plate containing the DNA 

25 was placed under the 384-pin tool. The independent 384 needles were simultaneously dipped into the DNA solution 
for DNA deposition. The needles were gently pressed onto a nylon membrane and the DNA deposited at the tips of 
needles was spotted onto the membrane. Denaturation of the spotted DNA and immobilization of the DNA on the nylon 
membrane were carried out according to standard methods (J. Sambrook, E.F., Fritsh, & T. Maniatis, "Molecular Clon- 
ing, A laboratory manual/ 2nd edition, Cold Spring Harbor Laboratory Press, 1989). 

30 [0202] A probe for hybridization was radioisotope-labeled first strand cDNA. Synthesis of the first strand cDNA was 
performed by using Thermoscript*™) RT-PCR System (GIBCO). Specifically, the first strand cDNA was synthesized 
by using 1.5 .g of mRNAs from various human tissues (Clontech), 1 .1 of 50.M Oligo(dT)20 and SO.Ci [. 33 P]dATP 
according to an attached protocol. Purification of a probe was carried out by using ProbeQuantf™) G-50 micro column 
(Amersham-Pharmacia Biotech) according to an attached protocol. In the next step, 2 units of E. coli RNase H were 

35 added to the reaction mixture. The mixture was incubated at room temperature for 10 minutes, and then, 100.g of 
human COT-1 DNA (GIBCO) was added thereto. The mixture was incubated at 97. for 1 0 minutes and then was allowed 
to stand on ice to give hybridization probe. 

[0203] Hybridization of the radioisotope-labeled probe to the DNA array was performed according to standard meth- 
ods (J. Sambrook, E.F., Fritsh, & T. Maniatis, Molecular Cloning, A laboratory manual/ 2nd edition, Cold Spring Harbor 

40 Laboratory Press, 1989). The membrane was washed as follows: the nylon membrane was washed 3 times by incu- 
bating it in Washing solution 1 (2xSSC, 1% SDS) at room temperature (about 26.) for 20 minutes; then the membrane 
was washed 3 times by incubating it in Washing solution 2 (O.lxSSC, 1% SDS) at 65. for 20 minutes. 
[0204] Autoradiography was performed by using an image plate for BAS2000 (Fuji Photo Film Co., Ltd.). Specifically, 
the nylon membrane with probe hybridized thereon was wrapped with a piece of Saran Wrap and brought into tight 

45 contact with the image plate on the light-sensitive surface. The membrane with the image plate was placed in an 
imaging cassette for radioisotope and allowed to stand in dark place for 4 hours. The radioactivity recorded on the 
image plate was analyzed by using BAS2000 (Fuji Photo Film Co., Ltd.). The activity was subjected to electronic 
conversion and recorded as an image file of autoradiogram. The signal intensity of each DNA spot was analyzed by 
using Visage High Density Grid Analysis Systems (Genomic Solutions Inc.). The signal intensity was converted into 

so numerical data. The data were taken in duplicate. The reproducibility was assessed by comparing the signal intensities 
of the corresponding spots on the duplicated DNA filters that were hybridized to a single DNA probe (Figure 2). In 95% 
of entire spots, the ratio between the corresponding spots falls within a range of 2 or less, and the correlation coefficient 
is r=1 .97. Thus, the reproducibility is satisfactory. 

[0205] The detection sensitivity in gene expression analysis was estimated by examining increases in the signal 
55 intensity of probe concentration-dependent spot in hybridization using a probe complementary to the DNA spotted on 
the nylon membrane. DNA used was PLACE 1008092 (the same as DNA deposited in GenBank under an Accession 
No. AF1 07253). The DNA array with DNA of PLACE1 008092 was prepared according to the above-mentioned method. 
The probe used was prepared as follows: mRNA was synthesized in vitro from the clone, PLACE1 008092. By using 
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this mRNA as a template, radioisotope-labeled first strand cDNA was synthesized in the same manner as described 
above, and the cDNA was used as the probe. In order to synthesize mRNA from PLACE1 008092 in vitro, a plasmid 
in which the 5' end of the cDNA PLACE1 008092 was ligated to the T7 promoter of pBluescript SK(-) was constructed. 
Specifically, the PLACE1 008092 insert was cut out from pME1 8SFL3 carrying the cDNA at a Dralll site thereof by Xhol 
digestion. The resulting PLACE1 008092 fragment was ligated to Xhol-predigested pBluescript SK(-) by using DNA 
ligation kit ver.2 (Takara). The in vitro mRNA synthesis from PLACE1 008092 inserted into pBluescript SK(-) was carried 
out by using Ampliscribe<™) T7 high yield transcription kit (Epicentre technologies). Hybridization and the analysis of 
signal intensity of each DNA spot were performed by the same methods as described above. When the probe con- 
centration is 1x10 7 .g/ml or less, there was no increase of signal intensity proportional to the probe concentration. 
Therefore, it was assumed to be difficult to compare the signals with one another in this concentration range. Thus, 
the spots with the intensity of 40 or less were uniformly taken as low level signals (Figure 3). Within a concentration 
of the probe ranging from 1 x1 0 7 .g/ml to 0.1 .g/ml, the signal was found to increase in a probe concentration-dependent 
manner. The detection limit represented as the ratio of the expression level of test mRNA to that of total mRNA in a 
sample was 1:100,000. 

[0206] Tables 28-184 (also containing clones without description in Examples) show the expression of each cDNA 
in human normal tissues (heart, lung, pituitary gland, thymus, brain, kidney, liver and spleen). The expression levels 
are indicated with numerical values of 0-10,000. Genes that were expressed in at least a single tissue are indicated 
below by the corresponding clone names: 

clone: BNGH41 000020, BNGH41 000087, BNGH41 000091, 
HEMBA1 000300, HEMBA1 000443, HEMBA1 000462, HEMBA1 000477, 



HEMBA1 000835, 
HEMBA1001390, 
HEMBA1002164, 
HEMBA1002316, 
HEMBA1 002992, 
HEMBA1 003294, 
HEMBA1 003945, 
HEMBA1 004444, 
HEMBA1 005084, 
HEMBA1 005449, 
HEMBA1005016, 
HEMBA1 006430, 
HEMBA1 006749, 
HEMBA1007013, 
HEMBB1 000309, 
HEMBB1 000567, 
HEMBB1001530, 
HEMBB1 002041, 
HEMBB1 002465, 
MAMMA1000106, 
MAM MA1 000457, 
MAMMA1 000681, 
MAMMA1 000986, 
MAMMA1001150, 
MAMMA1001609, 
MAMMA1 002070, 
MAMMA1 002205, 
NT2RM1 000542, 
NT2RM2000410, 
NT2RM2000589, 
NT2RM2001941, 



HEMBA1 000875, 
HEMBA1001563, 
HEMBA1002167, 
HEMBA1 002421, 
HEMBA1 003047, 
HEMBA1003315, 
HEMBA1 004007, 
HEMBA1 004454, 
HEMBA1005145, 
HEMBA1 005489, 
HEMBA1006171, 
HEMBA1 006482, 
HEMBA1 006770, 
HEMBA1 007057, 



HEMBA1 000940, 
HEMBA1001621, 
HEMBA1002178, 
HEMBA1 002524, 
HEMBA1003072, 
HEMBA1 003392, 
HEMBA1 004067, 
HEMBA1 004505, 
HEMBA1 005230, 
HEMBA1 005545, 
HEMBA1 006276, 
HEMBA1006517, 
HEMBA1006902, 
HEMBA1007063, 



HEMBB1 000447, HEMBB1 000542, 
HEMBB1000642, HEMBB1000905, 



HEMBB1001573, 
HEMBB1002051, 
HEMBB1 002661, 
MAMMA1000118, 
MAMMA1 000473, 
MAMMA1 000706, 
MAMMA1 000994, 
MAMMA1001284, 
MAMMA1001615, 
MAMMA1 002091, 
MAMMA1 002224, 
NT2RM 1000789, 
NT2RM2000423, 
NT2RM2000622, 
NT2RM4000198, 



HEMBB1001847, 
HEMBB1002162, 
HEMBB1 002663, 
MAMMA1 000204, 
MAMMA1 000528, 
MAMMA1 000788, 
MAMMA1001043, 
MAMMA1001310, 
MAMMA1001634, 
MAMMA1 002095, 
MAM MA1 002586, 
NT2RM1 000855, 
NT2RM2000497, 
NT2RM2000773, 
NT2RM4000295, 



HEMBA1 000962, 
HEMBA1 002048, 
HEMBA1002195, 
HEMBA1 002551, 
HEMBA1003101, 
HEMBA1 003399, 
HEMBA1001085, 
HEMBA1 004797, 
HEMBA1 005246, 
HEMBA1 005698, 
HEMBA1006311, 
HEMBA1 006544, 
HEMBA1006912, 
HEMBA1 007291, 

HEMBB1001026, 
HEMBB1001959, 
HEMBB1 002228, 
HEMBB1 002693, 
MAMMA1 000226, 
MAMMA1 000591, 
MAMMA1 000810, 
MAMMA1001066, 
MAMMA1001344, 
MAMMA1001893, 
MAMMA1002128, 
MAMMA1003126, 
NT2RM 1000858, 
NT2RM2000514, 
NT2RM2001126, 
NT2RM4000444, 



NT2RM4000965, NT2RM4000997, NT2RM4001321 , NT2RM4001325, 



NT2RM4001377, 
NT2RP1 000271, 
NT2RP1 000903, 
NT2RP2000479, 
NT2RP2001200, 
NT2RP2001529, 



NT2RM4001735, 
NT2RP1 000300, 
NT2RF1 000981, 
NT2RP2000533, 
NT2RP2001223, 
NT2RP2001538, 



NT2RM4001768, 
NT2RP1 000325, 
NT2RP2000092, 
NT2RP2000610, 
NT2RP2001388, 
NT2RP2001562, 



NT2RM4001843, 
NT2RP1 000465, 
NT2RP2000178, 
NT2RP2000616, 
NT2RP2001469, 
NT2RP2001662, 



HEMBA1000121, 
HEMBA1 000634, 
HEMBA1001228, 
HEMBA1002131, 
HEMBA1 002227, 
HEMBA1 002767, 
HEMBA1003120, 
HEMBA1 003487, 
HEMBA1 004110, 
HEMBA1 004952, 
HEMBA1 005337, 
HEMBA1 005929, 
HEMBA1 006335, 
HEMBA1 006658, 
HEMBA1006916, 
HEMBA1007332, 

HEMBB1001048, 
HEMBB1001978, 
HEMBB1 002302, 
MAMMA1 000046, 
MAMMA1 000403, 
MAMMA1000614, 
MAMMA1000814, 
MAMMA1001094, 
MAMMA1001418, 
MAMMA1001901, 
MAMMA1002142, 
NT2RM1 000407, 
NT2RM2000241, 
NT2RM2000565, 
NT2RM2001626, 
NT2RM4000593, 

NT2RP1 000002, 
NT2RP1 000468, 
NT2RP2000240, 
NT2RP2000694, 
NT2RP2001480, 
NT2RP2001878, 



HEMBA1000275, 
HEMBA1000713, 
HEMBA1001296, 
HEMBA1002163, 
HEMBA1 002239, 
HEMBA1 002985, 
HEMBA1 003230, 
HEMBA1 003530, 
HEMBA1 004391, 
HEMBA1 005070, 
HEMBA1 005430, 
HEMBA1 005945, 
HEMBA1 006357, 
HEMBA1 006707, 
HEMBA1006960, 
HEMBB1000106, 

HEMBB1001407, 
HEMBB1 002039, 
HEMBB1 002427, 
MAMMA1000102, 
MAMMA1 000449, 
MAMMA1 000652, 
MAMMA1 000881, 
MAMMA1001141, 
MAMMA1001532, 
MAMMA1001957, 
MAMMA1002165, 
NT2RM 1000462, 
NT2RM2000306, 
NT2RM2000582, 
NT2RM2001792, 
NT2RM4000761, 

NT2RP1000181, 
NT2RP1 000740, 
NT2RP2000447, 
NT2RP2000739, 
NT2RP2001514, 
NT2RP2001903, 
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NT2RP2001921, 
NT2RP2002409, 
NT2RP2002824, 
NT2RP2003302, 

5 NT2RP2003599, 
NT2RP2004141, 
NT2RP2004606, 
NT2RP2005069, 
NT2RP2005425, 

10 NT2RP2005666, 
NT2RP2006042, 
NT2RP3000059, 
NT2RP3000201, 
NT2RP3000645, 

15 NT2RP3000818, 
NT2RP3001159, 
NT2RP3001592, 
NT2RP3002015, 
NT2RP3002409, 

20 NT2RP3002900, 
NT2RP3003469, 
NT2RP3003874, 
NT2RP3004090, 
NT2RP3004309, 

25 NT2RP3004552, 
NT2RP4000634, 
OVARC1 000090, 
OVARC1 000331, 
OVARC1 000775, 

30 OVARC1001030, 
OVARC1001569, 
PLACE1 000231, 
PLACE1001016, 
PLACE1001407, 

35 PLACE1 002080, 
PLACE1 002547, 
PLACE1 003407, 
PLACE1 003644, 
PLACE1004166, 

40 PLACE1 004441, 
PLACE1 004648, 
PLACE1 005410, 
PLACE1 005736, 
PLACE1 006073, 

45 PLACE1 007077, 
PLACE1 009067, 
PLACE1011045, 
PLACE1011708, 
PLACE4000455, 

so THYR01 000866, 
THYRO1001237, 
THYRO1001523, 
THYRO1001725, 
Y79AA1 000270, 

55 Y79AA1 000967, 
Y79AA1001427, 
Y79AA1001787, 



NT2RP2001956, 
NT2RP2002510, 
NT2RP2002942, 
NT2RP2003369, 
NT2RP2003655, 
NT2RP2004179, 
NT2RP2004648, 
NT2RP2005163, 
NT2RP2005463, 
NT2RP2005774, 
NT2RP2006092, 
NT2RP3000063, 
NT2RP3000232, 
NT2RP3000652, 
NT2RP3000820, 
NT2RP3001170, 
NT2RP3001685, 
NT2RP3002160, 
NT2RP3002411, 
NT2RP3002958, 
NT2RP3003473, 
NT2RP3003939, 
NT2RP3004119, 
NT2RP3004321, 
NT2RP3004557, 
NT2RP4001877, 
OVARC1000105, 
OVARC1 000410, 
OVARC1 000853; 
OVARC1001049, 
OVARC1001570, 
PLACE1 000258, 
PLACE1001100, 
PLACE1001500, 
PLACE1 002095, 
PLACE1 002726, 
PLACE1 003428, 
PLACE1 003772, 
PLACE1004168, 
PLACE1 004482, 
PLACE1004816, 
PLACE1 005426, 
PLACE1 005768,. 
PLACE1 006079, 
PLACE1 007096, 
PLACE1 009527, 
PLACE1011116, 
PLACE1011978, 
SKNMC1000014, 
THYR01 000999, 
THYRO1001242, 
THYRO1001529, 
THYRO1001770, 
Y79AA1 000426, 
Y79AA1001013, 
Y79AA1001430, 
Y79AA1001793, 



NT2RP2002015, 
NT2RP2002527, 
NT2RP2002974, 
NT2RP2003383, 
NT2RP2003940, 
NT2RP2004205, 
NT2RP2004794, 
NT2RP2005181, 
NT2RP2005535, 
NT2RP2005878, 
NT2RP2006099, 
NT2RP3000125, 
NT2RP3000304, . 
NT2RP3000676, 
NT2RP3000838,. 
NT2RP3001195, 
NT2RP3001738, 
NT2RP3002281, 
NT2RP3002721, 
NT2RP3003000, 
NT2RP3003532, 
NT2RP3003963, 
NT2RP3004130, 
NT2RP3004355, 
NT2RP3004625, 
NT2RP4001879, 
OVARC1000137, 
OVARC1 000439, 
OVARC1 000873, 
OVARC1001086, 
OVARC 100 1596, 
PLACE1 000442, 
PLACE1001114, 
PLACE1001516, 
PLACE1002153, 
PLACE1 002905, 
PLACE1 003438, 
PLACE 1003839, 
PLACE1004199, 
PLACE 1004492, 
PLACE1 004887, . 
PLACE1 005539, 
PLACE1005815, 
PLACE1 006277, 
PLACE 1007626, 
PLACE1 009982, 
PLACE1011181, 
PLACE2000118, 
THYR01 000061, 
THYR01 001063, 
THYRO1001456, 
THYRO1001593, 
THYRO1001803, 
Y79AA1 000521, 
Y79AA1001090, 
Y79AA1001523, 
Y79AA1001799, 



NT2RP2002063 
NT2RP2002533 
NT2RP2003138 
NT2RP2003390 
NT2RP2003950 
NT2RP2004447 
NT2RP2004837 
NT2RP2005247 
NT2RP2005541 
NT2RP2005887 
NT2RP2006269 
NT2RP3000148 
NT2RP3000378 
NT2RP3000677 
NT2RP3000907 
NT2RP3001271 
NT2RP3001754 
NT2RP3002311 
NT2RP3002737 
NT2RP3003076 
NT2RP3003614 
NT2RP3004025 
NT2RP3004133 
NT2RP3004374 
NT2RP3004640 
NT2RP4002187 
OVARC1 000208 
OVARC1 000467 
OVARC1000916 
OVARC1001163 
OVARC1001807 
PLACE1 000560 
PLACE1001183 
PLACE1001655 
PLACE1 002329 
PLACE1 002911 
PLACE1 003460, 
PLACE1 003845 
PLACE1 004279 
PLACE1004520 ; 
PLACE1 005005 
PLACE1005544 : 
PLACE1 005878 ; 
PLACE1006443 : 
PLACE1 007702, 
PLACE1010078 : 
PLACE1011236 : 
PLACE2000219 
THYR01 000099, 
THYRO1001113 
THYRO1001457, 
THYRO1001608 
Y79AA1000127, 
Y79AA1 000776, 
Y79AA1001272, 
Y79AA1001530, 
Y79AA1001803, 



NT2RP2002188 
NT2RP2002564 
NT2RP2003179 
NT2RP2003469 
NT2RP2004069 
NT2RP2004524 
NT2RP2004847 
NT2RP2005378 
NT2RP2005597 
NT2RP2005941 
NT2RP30000H 
NT2RP3000171 
NT2RP3000436 
NT2RP3000721 
NT2RP3000921 
NT2RP3001388 
NT2RP3001858 
NT2RP3002324 
NT2RP3002738 ; 
NT2RP3003354 
NT2RP3003729 
NT2RP3004067 
NT2RP3004202 
NT2RP3004406 
NT2RP3004647 
NT2RP4002715 
OVARC1 000255 
OVARC 1000529 : 
OVARC1 000956 
OVARC1001260, 
OVARC1001833: 
PLACE1000912 
PLACE1001229 
PLACE1001836 
PLACE1 002374 
PLACE1 002967: 
PLACE1 003529 
PLACE1 003852 
PLACE1 004282 
PLACE1 004630 
PLACE1 005031 
PLACE1 005569 
PLACE1 005927 
PLACE1006716 
PLACE1 008469 
PLACE1010251 
PLACE1011364 
PLACE3000181 
THYR01 000584 
THYRO1001128: 
THYRO1001478: 
THYRO1001700: 
Y79AA1 000207, 
Y79AA1 000777 
Y79AA1001328 
Y79AA1001592 
Y79AA1001863 



NT2RP2002232, 
NT2RP2002721, 
NT2RP2003210, 
NT2RP2003593, 
NT2RP2004108, 
NT2RP2004556, 
NT2RP2005027, 
NT2RP2005391, 
NT2RP2005632, 
NT2RP2006004, 
NT2RP3000022, 
NT2RP3000172, 
NT2RP3000460, 
NT2RP3000789, 
NT2RP3001044, 
NT2RP3001560, 
NT2RP3001976, 
NT2RP3002353, 
: NT2RP3002836, 
NT2RP3003448,. 
NT2RP3003849, 
NT2RP3004083, 
NT2RP3004294, 
NT2RP3004481, 
NT2RP4000108, 
NT2RP4002750, 
OVARC 100031 3, 
OVARC 1000553, 
OVARC1 000995, 
OVARC 1001 336, 
OVARC1001991, 
PLACE1 000927, 
PLACE1001340, 
PLACE1001918, 
PLACE1002518, 
PLACE1003163, 
FLACE1 003598, 
PLACE 1004078, 
PLACE 1004305, 
PLACE 1004637, 
PLACE 1005383, 
PLACE1 005725, 
PLACE 1006071, 
PLACE1 006809, 
PLACE1 008985, 
PLACE1010445, 
PLACE1011516, 
PLACE4000354, 
THYR01 000795, 
THYRO1001205, 
THYRO1001495, 
THYRO1001702, 
Y79AA1 000226, 
Y79AA1 000888, 
Y79AA1001426, 
Y79AA1001727, 
Y79AA1 002022, 



Y79AA1002213, Y79AA1 002373, Y79AA1 002376, Y79AA1 002381 . 



106 



EP1 130 094 A2 



10 



20 



30 



35 



40 



45 



50 



55 



[0207] Genes that were expressed in all the tissues tested are indicated below by the corresponding clone names: 
clone: BNGH41000020, HEMBA1000300, HEMBA1001390, HEMBA 1002239, HEMBA1002316, HEMBA1004007, 
HEMBA1004067, HEMBA1005145, HEMBA1005230, HEMBA1005929, HEMBA1006357, HEMBA1006482, 
HEMBB1 000567, HEMBB1001847, NEMBB1001978, MAMMA1000614, MAMMA1 000652, MAM MA1 000810, 
MAMMA1000814, MAMMA1001066, MAMMA1001094, MAMMA1001284, MAMMA1001310, MAMMA1001634, 
MAMMA1002165, MAMMA1 002205, MAMMA1 002224, NT2RM1 000462, NT2RM1 000855, NT2RM1 000858, 

NT2RM4000997, NT2RP1 000271, NT2RP1 000325, NT2RP1 000465, 



NT2RM2000423, 
NT2RP2001538, 
NT2RP2002510, 
NT2RP2005163, 
NT2RP3001271, 
NT2RP3004647, 
OVARC1001049, 
PLACE1 003598, 



NT2RM4000761, 
NT2RP2001662, 
NT2RP2002533, 
NT2RP2005535, 
NT2RP3001754, 
OVARC1 000090, 
PLACE1 000231, 
PLACE1 004648, 



NT2RP2001903, 
NT2RP2004556, 
NT2RP2006269, 
NT2RP3003076, 



NT2RP2002015, 
NT2RP2004794, 
NT2RP3000171, 
NT2RP3003354, 



NT2RP2002188, 
NT2RP2004847, 
NT2RP3000645, 
NT2RP3003614, 



NT2RP2002409, 
NT2RP2005069, 
NT2RP3000838, 
NT2RP3004640, 



15 THYRO1001128, THYRO1001205, 



OVARC1 000208, OVARC1 000553, OVARC1 000995, OVARC1 001 030, 

PLACE1 000258, PLACE1001516, PLACE1 002080, PLACE1 00291 1 , 

PLACE1 006443, PLACE1 008469, PLACE1 011 708, PLACE20001 1 8, 

THYR01 001 242, THYR01 001 803, Y79AA1 000207, Y79AA1 001 01 3, 
Y79AA1001272, Y79AA1001328, Y79AA1 001 793, Y79AA1001863, Y79AA1 002022, Y79AA1 002376. 
[0208] Genes that were expressed at low levels in any of the tissues tested are indicated below by the corresponding 

clone names: clone: HEMBA1 000006, HEMBA1000128, HEMBA1 000349, HEMBA1 000590, HEMBA1 000671, 

HEMBA1000732, HEMBA1000745, HEMBA1000907, HEMBA1001184, HEMBA1001221 , HEMBA1001272, 
HEMBA1 001 878, HEMBA1 001 886, HEMBA1 002420, 
HEMBA1004250, HEMBA1 004785,. HEMBA1004971 , 
HEMBA1005913, HEMBA1006299, HEMBA1006572, 
HEMBB1000407, HEMBB1000668, HEMBB1000679, 
HEMBB1 0021 20, HEMBB1 002245, MAMMA1 0001 41 , 



HEMBA1001297 
HEMBA1 003732, 
HEMBA1005522 : 
HEMBB1 000276, 
HEMBB1001547 : 



25 MAMMA1001623, MAMMA1001978, MAMMA1 002080, MAMMA1 002087 



NT2RM1 000580 
NT2RM2001939 
NT2RM4000587 
NT2RP1 000448 
NT2RP1001020 
NT2RP2000663 
NT2RP2001755 
NT2RP2002674 
NT2RP2004495 
NT2RP2006134 
NT2RP3000871 
NT2RP3002286 
NT2RP3002887 
NT2RP3004075 
NT2RP4002451 
OVARC1001132 
OVARC1002058 : 
PLACE1001123 
PLACE1001788 
PLACE1 003737 
PLACE1 005250 
PLACE1 005745 
PLACE1 006786 
PLACE1 007591 
PLACE1 008359 
PLACE1009196 
PLACE1 009279 
PLACE1010713 
PLACE3000213 
THYR01 000580 
THYRO1001102 



NT2RM1 000899, NT2RM2000632, NT2RM2001643 

NT2RM4000100, NT2RM4000115, NT2RM4000284 

NT2RM4000648, NT2RM4002352, NT2RP1 000050, 

NT2RP1 000551 , NT2RP1 000579, NT2RP1000613 

NT2RP1 001 031 , NT2RP1 001 563, NT2RP2000394 

NT2RP2000712, NT2RP2000818, NT2RP2000903, 

NT2RP2001 769, NT2RP2001 81 7, NT2RP2001 91 5 

NT2RP2002976, NT2RP2003042, NT2RP2003545 



NT2RP2004670, NT2RP2005514, NT2RP2005883, NT2RP2005994, 



NT2RP30001 69, NT2RP3000444, 

NT2RP3001 061 , NT2RP3001 240, 

NT2RP3002448, NT2RP3002571 

NT2RP3003527, NT2RP3003535, 

NT2RP4000962, NT2RP4001 001 

OVARC1 000003, OVARC1 000275, OVARC1 000298 

OVARC1 001 222, OVARC1 001 338, OVARC1 001 607, 

PLACE1 000033, PLACE1 000740 

PLACE1001401, PLACE1001464 

PLACE1 001 949, PLACE1 002355 

PLACE1 004450, PLACE1 00451 9 

PLACE1 005601 , PLACE1 005660 

PLACE1 006208, PLACE1 00621 9 

PLACE1 007028, PLACE1 007040 

PLACE1 007881, PLACE1 007971 

PLACE1 008657, PLACE1 00871 6 



NT2RP2006512, 
NT2RP3001012, 
NT2RP3002342, 
NT2RP3002983, 
NT2RP3004345, 



OVARC1002178, 
PLACE1001231, 
PLACE1001795, 
PLACE1 004028, 
PLACE1005519, 
PLACE1 006093, 
PLACE1 006959, 
PLACE1 007845, 
PLACE1 008549, 



PLACE1 009546, 
PLACE1010784, 



PLACE1 009600, 
PLACE1010827, 



SKNMC1 000004, SKNMC1 000082, 



PLACE1 009735 
PLACE1010968, 
THYR01 000036 



THYR01 000678, THYR01 000776, THYR01 000846, 
THYRO1001266, 



HEMBA1 003497, 
HEMBA1 004982, 
HEMBA1 006724, 
HEMBB1 000881, 
MAMMA1 000496 
MAMMA1 002234, 
NT2PM2001818 
NT2RM4000326 
NT2RP1 000239 
NT2RP1 000679 
NT2RP2000514 
NT2RP2001276 
NT2RP2001948 
NT2RP2003664 



NT2RP3000481 
NT2RP3001322 
NT2RP3002664, 
NT2RP3003559, 
NT2RP4001009, 
OVARC1 000307, 
OVARC1001725 
PLACE1000914 
PLACE1001536 
PLACE1003135, 
PLACE1 005003, 
PLACE1 005669, 
PLACE1 006290, 
PLACE1 007081, 
PLACE1 008282, 
PLACE1 008744 

PLACE1010011 
PLACE1011407 
THYRO1000196 
THYR01 000956 



HEMBA1003602, 
HEMBA1005267, 
HEMBA1007241, 
HEMBB1001200, 
MAMMA1001237, 
MAMMA1 002633, 
NT2RM2001902, 
NT2RM4000417, 
NT2RP1 000261, 
NT2RP1001004, 
NT2RP2000649, 
NT2RP2001495, 
NT2RP2002304, 
NT2RP2003931, 

NT2RP3000616, 
NT2RP3001542, 
NT2RP3002790, 
NT2RP3004000, 
NT2RP4001467, 
OVARC1 000811, 
OVARC1001727, 
PLACE1 000986, 
PLACE1001564, 
PLACE1 003573, 
PLACE1 005239, 
PLACE1 005682, 
PLACE1006515, 
PLACE1 007296, 
PLACE1 008297, 
PLACE1 008984, 

PLACE1010081, 
PLACE1011824, 
THYR01 000400, 
THYRO1001071, 
Y79AA1 000959, 



THYR01 001 327, THYR01 001 471 , Y79AA1 000876, 
Y79AA1001056, Y79AA1001062, Y79AA1001264, Y79AA1001795. 

[0209] Genes exhibiting characteristic features in the expression thereof were selected by statistical analysis of these 
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data. Two examples are shown below to describe the selection of genes of which expression is varied greatly among 
tissues. The .-actin gene is used frequently as a control in gene expression analysis. Genes of which expression is 
varied greatly among tissues as compared that of the .-actin gene were determined as follows. Specifically, sum of 
squared deviation was calculated in the signal intensity of .-actin observed in each tissue, which was divided by 7 
5 degrees of freedom to determine a variance S a 2 . Next, sum of squared deviation was calculated in the signal intensity 
of a compared gene in each tissue, which was divided by 7 degrees of freedom to determine a variance S b 2 . By taking 
variance ratio F as F=S b 2 /S a 2 , genes with a significance level of 5% or more were extracted in the F distribution. Genes 
extracted are indicated below by the corresponding clone names: clone: BNGH41 000020, NT2RM4000761, 
Y79AA1 002376. 

w [0210] Gene of OVARC1 000037{heterogeneous nuclear ribonucleoprotein (hnRNP)} which expression is varied little. 
Genes of which expression is varied greatly among tissues as compared that of the OVARC1 000037 gene were de- 
termined as follows. Specifically, sum of squared deviation was calculated in the signal intensity of .-actin observed in 
each tissue, which was divided by 7 degrees of freedom to determine a variance S a 2 . Next, sum of squared deviation 
was calculated in the signal intensity of a gene to be compared observed in each tissue, which was divided by 7 degrees 

15 of freedom to determine a variance S b 2 . By taking variance ratio F as F=S b 2 /S a 2 , genes with a significance level of 5% 
or more were extracted in the F distribution. Genes extracted are indicated below by the corresponding clone names: 
clone: BNGH41 000020, H EM BA 1000300, OVARC1001030, NT2RM4000761, PLACE1 000231, HEMBA1002316, 
NT2RP1000325, NT2RP1000271, PLACE 1004648, HEMBA1005145, HEMBA1 005929, NT2RP2002510, 
NT2RP2001538, NT2RP2002409, NT2RP2002188, NT2RP2001903, NT2RP2002533, NT2RP2002015, 

20 NT2RP2006269, NT2RP2004837, NT2RP2004205, NT2RP2005378, HEMBA1 006357, HEMBB1 000567, 
NT2RP2003940, NT2RP2004794, HEMBA1006912, NT2RP2004556, NT2RP2005163, NT2RP3000838, 
NT2RP3001271, PLACE2000118, NT2RP3000645, NT2RP3003076, HEMBB1 002693, MAMMA1 000046, 
NT2RP3003354, THYR01001205, MAMMA1000614, MAMMA 1000652, MAMMA1 000810, THYRO1001242, 
MAMMA1001066, MAMMA1 002224, MAMMA1001634, MAMMA1001094, MAMMA1 002205, NT2RM1 000855, 

25 NT2RM1 000858, Y79AA1 002376, NT2RM2000423. 

[021 1 ] Thus, characteristic features in the expression of a gene are illustrated by comparing and statistically analyzing 
the expression of many genes. 

Analysis of disease-associated genes 

30 

[0212] Non-enzymic protein glycation reaction is believed to be a cause of a variety of chronic diabetic complications. 
Accordingly, genes of which expression is elevated or decreased in a glycated protein-specific manner in the endothelial 
cells are associated with diabetic complications caused by glycated proteins. Vascular endothelial cells are affected 
with glycated proteins present in blood. Reaction products of non-enzymic protein glycation include amadori compound 

35 (glycated protein) as a mildly glycated protein and advanced glycation endproduct as a heavily glycated protein. Hence, 
a survey was carried out for genes of which expression levels are varied depending on the presence of these glycated 
proteins in endothelial cells. The mRNAs were extracted from endothelial cells that were cultured in the presence or 
absence of glycated protein. The mRNAs were converted into radiolabeled first strand cDNAs for preparing probes. 
The probes were hybridized to the above-mentioned DNA array. Signal of each DNA spot was detected by BAS2000 

40 and analyzed by ArrayGauge (Fuji Photo Film Co., Ltd.). 

[0213] Advanced glycation endproduct of bovine serum albumin was prepared as follows: bovine serum albumin 
(BSA; Sigma) was incubated in a phosphate buffer solution containing 50 mM glucose at 37 for 8 weeks; and the 
resulting brownish BSA was dialyzed against a phosphate buffer solution. 

[0214] Human normal pulmonary arterial endothelial cells (Cell Applications) were cultured in an Endothelial Cell 
45 Growth Medium (Cell Applications). The culture dish (Farcon) with the cells were incubated in a C0 2 incubator (37., 
5% C0 2 , in a humid atmosphere). When the cells were grown to be confluent in the dish, 250.g/ml of bovine serum 
albumin (sigma), glycated bovine serum albumin (Sigma) or advanced glycation endproduct of bovine serum albumin 
was added thereto and the cells were incubated for 33 hours. The mRNA was extracted from the cells by using a 
FastTack(™) 2.0 kit (Invitrogen). The labeling of hybridization probe was carried out by using the mRNA according to 
so the same procedure as described above. 

[0215] Table 185 shows the expression level of each cDNA in human pulmonary arterial endothelial cells cultured 
in a medium containing bovine serum albumin (sigma), glycated bovine serum albumin (Sigma) or advanced glycation 
endproduct of bovine serum albumin. Genes of which expression was detected in the endothelial cell are as follows: 
BNGH41000020, BNGH41000087, HEMBA1000275, HEMBA1000300, HEMBA1000477, HEMBA1000634, 
55 HEMBA1 000671, HEMBA1000713, HEMBA1 000745, HEMBA1 000835, HEMBA1 000875, HEMBA1 000940, 
HEMBA1001390, HEMBA1002131 , HEMBA1002163, HEMBA1002164, HEMBA1002195, HEMBA1002227, 
HEMBA1 002239, HEMBA1 002420, HEMBA1 002767, HEMBA1 002992, HEMBA1 003047, HEMBA1003120, 
HEMBA1 003294, HEMBA1003315, HEMBA1 003602, HEMBA1 003945, HEMBA1 004007, HEMBA1 004067, 
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HEMBA1 004971, HEMBA1 005145, HEMBA1 005267, 
HEMBA1005945, HEMBA1006171, HEMBA1006299, 
HEMBA1 006482, HEMBA1 006658, HEMBA1 006724, 
HEMBA1007063, HEMBB1000447, HEMBB1000642, 
HEMBB1 001 573, HEMBB1 001 847, HEMBB1 001 978, 
HEMBB1 002693, MAMMA1000102, MAMMA1000106, 
MAMMA1000614, MAM MA1 000652, MAMMA1 000810, 
MAMMA1001066, MAMMA1001237, MAMMA1001284, 
MAMMA1001893, MAMMA1001901, MAMMA1001957, 
MAMMA1 002205, MAMMA1 002224, MAMMA1 002633, 
NT2RM1 000789, NT2RM1 000855, NT2RM1 000858, 



55 



NT2RM2000241 

NT2RM2000622 

NT2RM4000587 

NT2RM4001377 

NT2RP1 000271 

NT2RP2000092 

NT2RP2000649 

NT2RP2001480 

NT2RP2001903 

NT2RP2002409, 

NT2RP2002824, 

NT2RP2003593 

NT2RP2004108 

NT2RP2004606 

NT2RP2005069 

NT2RP2005541 

NT2RP2005878, 

NT2RP3000171 

NT2RP3000652 

NT2RP3000907, 

NT2RP3001592 

NT2RP3002353, 

NT2RP3002836, 

NT2RP3003532, 

NT2RP3004075, 

NT2RP3004294, 

NT2RP3004640, 

NT2RP4001467, 

OVARC1 000003, 

OVARC1000313 

OVARC1 000853, 

OVARC1001049, 

OVARC1001222 

OVARC1001607, 

PLACE1 000740 

PLACE1001229 

PLACE1001918 

PLACE1 002547 

PLACE1 003407 

PLACE1 003839 

PLACE1 004279 

PLACE1 004887 

PLACE1005519 

PLACE1 005745 

PLACE1 006290 

PLACE1 007626 

PLACE1009196, 



NT2RM2000306 
NT2RM2000773 
NT2RM4000593 
NT2RM4001735 
NT2RP1 000300 
NT2RP2000240 
NT2RP2000663 
NT2RP2001495 
NT2RP2001948 
NT2RP2002510 
NT2RP2002942 
NT2RP2003599 
NT2RP2004179 
NT2RP2004648 
NT2RP2005163 



NT2RP2005632, NT2RP2005774 



NT2RP2006099 
NT2RP3000201 
NT2RP3000676 
NT2RP3001159 
NT2RP3001738 
NT2RP3002409 
NT2RP3002900 
NT2RP3003614 
NT2RP3004083, 
NT2RP3004309 
NT2RP3004647 
NT2RP4001877, 
OVARC1 000090 
OVARC1 000331 
OVARC1 000873 



OVARC1001086, OVARC1001132, 



OVARC1001260 
OVARC1001807, 
PLACE1 000927, 
PLACE1001340 
PLACE1 002080 
PLACE1 002726, 
PLACE1 003460, 
PLACE1 003845 
PLACE1 004282 
PLACE1 005003 
PLACE1 005544, 
PLACE1 005768, 
PLACE1 006443 
PLACE1 007702 
PLACE1 009600, 



NT2RM2000410, 
NT2RM4000100, 
NT2RM4000761, 
NT2RM4001768, 
NT2RP1 000325, 
NT2RP2000479, 
NT2RP2000712, 
NT2RP2001529, 
NT2RP2001956, 
NT2RP2002527, 
NT2RP2002976, 
NT2RP2003664, 
NT2RP2004205, 
NT2RP2004794, 
NT2RP2005247, 



NT2RP2006134, 
NT2RP3000232, 
NT2RP3000721, 
NT2RP3001195, 
NT2RP3001754, 
NT2RP3002448, 
NT2RP3003076, 
NT2RP3003939, 
NT2RP3004090, 
NT2RP3004345, 
NT2RP4000108, 
NT2RP4001879, 
OVARC1000105, 
OVARC1 000410, 
OVARC1000916, 



OVARC1001336, 
OVARC1001991, 
PLACE1001016, 
PLACE1001407, 
PLACE1 002095, 
PLACE1 002905, 
PLACE1 003573, 
PLACE1 004078, 
PLACE1 004441, 
PLACE1 005005, 
PLACE1 005660, 
PLACE1005815, 
PLACE1 006809, 
PLACE1 007845, 
PLACE1 003735, 



HEMBA1 005337, 
HEMBA1 006335, 
HEMBA1 006770, 
HEMBB1 000905, 
HEMBB1 002041, 
MAMMA1 000204, 
MAMMA1000814, 
MAMMA1001344, 
MAMMA1002087, 
MAMMA1003126, 

NT2RM2000423, 

NT2RM4000198, 

NT2RM4000997, 

NT2RM4001843, 

NT2RP1 000465, 

NT2RP2000533, 

NT2RP2000903, 

NT2RP2001538, 

NT2RP2002015, 

NT2RP2002533, 

NT2RP2003138, 

NT2RP2003931 

NT2RP2004495, 

NT2RP2004837, 

NT2RP2005378, 



NT2RP2006269, 
NT2RP3000436, 
NT2RP3000818, 
NT2RP3001240, 
NT2RP3002015, 
NT2RP3002721 
NT2RP3003354, 
NT2RP3003963, 
NT2RP3004119 
NT2RP3004374, 
NT2RP4000634, 
NT2RP4002187, 
OVARC1000137 
OVARC1 000439 
OVARC1 000956 

OVARC1001569 
PLACE1 000231 
PLACE1001100, 
PLACE1001464, 
PLACE1 002329, 
PLACE1002911 
PLACE1 003598, 
PLACE1004166, 
PLACE1 004482, 
PLACE1 005031 
PLACE1 005669, 
PLACE1 006073, 
PLACE1 006959, 
PLACE1 008282, 
PLACE1010081 



HEMBA1 005698, 
HEMBA1 006357, 
HEMBA1006912, 
HEMBB1001026, 
HEMBB1 002427, 
MAMMA1 000403, 
MAMMA1 000881, 
MAMMA1001615, 
MAMMA1 002095, 
NT2RM1 000462, 

NT2RM2000582 
NT2RM4000284 
NT2RM4001321 
NT2RP1 000002, 
NT2RP1 000740, 
NT2RP2000610, 
NT2RP2001276 
NT2RP2001662 
NT2RP2002188 
NT2RP2002564 
NT2RP2003210, 
NT2RP2003940, 
NT2RP2004524, 
NT2RP2004847, 
NT2RP2005425, 

NT2RP2006512 

NT2RP3000460, 

NT2RP3000820, 

NT2RP3001271 

NT2RP3002324 

NT2RP3002737 

NT2RP3003448 

NT2RP3004025 

NT2RP3004130, 

NT2RP3004557 

NT2RP4001001 

NT2RP4002451 

OVARC1 000208, 

OVARC1 000553, 

OVARC1 000995, 

OVARC1001570, 
PLACE1 000258 
PLACE1001114 
PLACE1001788, 
PLACE1 002374, 
PLACE1 002967, 
PLACE1 003644, 
PLACE1004168 
PLACE 1004492 
PLACE1 005250, 
PLACE1 005725, 
PLACE1 006208 
PLACE1 007028, 
PLACE1 008469, 
PLACE1010251 



HEMBA1 
HEMBA1 
HEMBA1 
HEMBB1 
HEMBB1 
MAMMA1 
MAMMA1 
MAMMA1 
MAMMA1 
NT2RM1 



005929, 
006430, 
006960, 
001048, 
002663, 
000449, 
000986, 
001634, 
002165, 
000580, 



NT2RM2000589, 
NT2RM4000444, 
NT2RM4001325, 
NT2RP1000181, 
NT2RP1 000981, 
NT2RP2000616, 
NT2RP2001388, 
NT2RP2001878, 
NT2RP2002232, 
NT2RP2002721, 
NT2RP2003390, 
NT2RP2004069, 
NT2RP2004556, 
NT2RP2005027, 
NT2RP2005535, 

NT2RP3000011, 
NT2RP3000645, 
NT2RP3000838, 
NT2RP3001388, 
NT2RP3002342, 
NT2RP3002738, 
NT2RP3003473, 
NT2RP3004067, 
NT2RP3004133, 
NT2RP3004625, 
NT2RP4001009, 
NT2RP4002715, 
OVARC 1000298, 
OVARC 1000775, 
OVARC1001030, 



OVARC 1 
PLACE1 
PLACE1 
PLACE 1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE 1 
PLACE1 
PLACE1 



001596, 
000442, 
001123, 
001795, 
002518, 
003163, 
003772, 
004199, 
004637, 
005410, 
005736, 
006219, 
007296, 
008657, 
010713, 
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PLACE1011116, PLACE1011181, 

PLACE1011236, PLACE1011516, PLACE1011708, PLACE1011824, PLACE1011978, PLACE2000118, 
PLACE3000181, SKNMC1 000004, SKNMC1000014, THYR01 000584, THYR01 000866, THYRO1001113, 
THYRO1001128, THYRO1001205, THYRO1001242, THYRO1001495, THYRO1001523, THYRO1001529, 

5 THYRO1001593, THYRO1001608, THYRO1001702, THYRO1001725, THYRO1001770, THYRO1001803, 
Y79AA1000117, Y79AA1 000207, Y79AA1 000226, Y79AA1 000270, Y79AA1 000426, Y79AA1 000777, 
Y79AA1 000876, Y79AA1 000888, Y79AA1 000959, Y79AA1001013, Y79AA1001056, Y79AA1001090, 
Y79AA1001264, Y79AA1 001272, Y79AA1001328, Y79AA1 001427, Y79AA1001430, Y79AA1001530, 
Y79AA1001592, Y79AA1001727, Y79AA1001793, Y79AA1001799, Y79AA1001863, Y79AA1 002022, 

10 Y79AA1002213, Y79AA1 002373, Y79AA1 002376, Y79AA1 002381. 

[0216] Signal ratios of EC_AGE_BSA to EC_BSA and of EC_glycated_BSA to EC_BSA were calculated for each 
gene. Genes with high signal ratios were selected. In the case of calculating the ratio of signal value of 40 or less to 
that of more than 40, such signal values were, for convenience, taken as 40 instead of the real values. When the ratio 
EC_AGE_BSA/EC_BSA is 2 or more, expression of the genes exhibiting such ratio is expected to be elevated due to 

15 advanced glycation endproduct of bovine serum albumin. The higher the value is, the higher the gene expression level 
is. When the ratio EC_AG E_BS A/EC_BS A ranges from 0.5 to 2, expression of the genes exhibiting such ratio is ex- 
pected to be unaffected due to advanced glycation endproduct of bovine serum albumin. When the ratio EC_AGE_BSA/ 
EC_BSA is less than 0.5, expression of the genes exhibiting such ratio value is expected to be decreased due to 
advanced glycation endproduct of bovine serum albumin. The lower the value is, the lower the gene expression level is. 

20 [0217] Clone with EC_AG E_BSA/BC_BSA ratio of 2 or higher are as follows: NT2RP2001 538, NT2RP4001 001 and 
Y79AA1 000967. 

[0218] These cDNAs are associated with diabetes. 
Analysis of genes associated with neural cell differentiation 

25 

[0219] Genes involved in neural cell differentiation are useful for treating neurological diseases. It is possible that 
genes with varying expression levels in response to induction of cellular differentiation in neural cells are associated 
with neurological diseases. 

[0220] A survey was performed for genes of which expression levels are varied in response to induction of differen- 

30 tiation (stimulation by retinoic acid (RA)) in cultured cells of a neural strain, NT2. 

[0221] The NT2 cells were treated basically according to supplier's instruction manual. "Undifferentiated NT2 cells" 
means NT2 cells successively cultured in an Opti-MEM I (GIBCO-BRL; catalog No. 31985) containing 10%(v/v) fetal 
bovine serum and 1%(v/v) penicillin-streptomycin (GIBCO BRL). "NT2 cells cultured in the presence of retinoic acid" 
means the cells resulted from transferring undifferentiated NT2 cells into a retinoic acid-containing medium, which 

35 consists of D-MEM (GIBCO BRL; catalog No. 11965), 10%(v/v) fetal bovine serum, 1%(v/v) penicillin-streptomycin and 
10.M retinoic acid (GIBCO-BRL), and the subsequent successive culture therein for 5 weeks. "NT2 cells that were 
cultured in the presence of retinoic acid and then further cultured in the presence of cell-division inhibitor added" means 
NT2 cells resulted from transferring NT2 cells cultured in the presence of retinoic acid for 5 weeks into a cell-division 
inhibitor-containing medium, which consisted of D-MEM (GIBCO BRL; catalog No.1 1 965), 1 0%(v/v) fetal bovine serum, 

40 1%(v/v) penicillin-streptomycin, 10. M retinoic acid, 10.M FudR (5-fluoro-2'-deoxyuridine: GIBCO BRL), 10. M Urd 
(Uridine: GIBCO BRL) and 1 .M araC (Cytosine.-D-Arabinofuranoside: GIBCO BRL), and the subsequence successive 
culture for 2 weeks. Each of the cells were treated with trypsin and then harvested. Total RNAs were extracted from 
the cells by using S.N.A.P.t™) Total RNA Isolation kit (Invitrogen). The labeling of probe used for hybridization was 
carried out by using 1 0.g of the total RNA according to the same methods as described above. The data were obtained 

45 in triplicate (n=3). The data of signal value representing gene expression level in the cells in the presence of stimulation 
for inducing differentiation were compared with those in the absence of the stimulation. The comparison was performed 
by statistical treatment of two-sample t-test. Clones with significant difference in the signal distribution were selected 
under the condition of p<0.05. In this analysis, clones with the difference can be statistically detected even when the 
signals were low. Accordingly, clones with signal value of 40 or less were also assessed for the selection. 

so [0222] Tables 186-365 show the expression level of each cDNA in undifferentiated NT2 cells, NT2 cells cultured in 
the presence of RA, and NT2 cells that were cultured in the presence of RA and that were further cultured in the 
presence of cell-division inhibitor added. 

[0223] Averaged signal values (M v M 2 ) and sample variances (s^, s 2 2 ) were calculated for each gene in each of 
the cells, and then, the pooled sample variances s 2 were obtained from the sample variances of the two types of cells 
55 to be compared. The t values were determined according to the following formula: t=(M1-M2)/s/(1/3+1/3) 1/2 . When the 
determined t-value was greater than a t-value at P, which means the probability of significance level, of 0.05 or 0.01 
in the t-distribution table with 4 degrees of freedom, the difference was judged to be found in the expression level of 
the gene between the two types of cells at p<0.05 or p<0.01 , respectively. The tables also include the information on 
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an increase (+) or decrease (-) in the expression level of a gene in the treated cells when the level is compared with 
that of untreated undifferentiated cells. 

[0224] Clones of which expression levels increased by RA are as follows: HEMBA1 000121, HEMBA1 000275, 

HEMBA1 000300, HEMBA1 000634, HEMBA1 000671, HEMBA1000875, HEMBA1001184, HEMBA1001390, 

HEMBA1001886, HEMBA1002163, HEMBA1 002227, HEMBA1 002420, HEMBA1002421, HEMBA1 003072, 

HEMBA1003120, HEMBA1003294, HEMBA1003497, HEMBA1004007, HEMBA1004110, HEMBA1 004391, 

HEMBA1 004444, HEMBA1005230, HEMBA1005246, HEMBA1 005267, HEMBA1 005489, HEMBA1005913, 

HEMBA1 006299, HEMBA1006357, HEMBA1006517, HEMBA1 006544, HEMBA1 006658, HEMBA1 006749, 

HEMBA1 007063, HEMBA1 007241, HEMBB1000447, HEMBB1000542, HEMBB1000567, HEMBB1 000642, 

HEMBB1 000668, HEMBB1001026, HEMBB1001847, HEMBB1002051 , HEMBB1002120, HEMBB1 002228, 

HEMBB1 002693, MAMMA1000106, MAMMA1000141, MAMMA1 000473, MAM MA 1000528, MAMMA1 000810, 

MAMMA1 000881, MAMMA1001634, MAMMA1001957, MAMMA1 002205, MAMMA1 002224, NT2RM2000423, 

NT2RM2000497, NT2RM2000582, NT2RM2001126, NT2RM2001902, NT2RM4000198, NT2RM4000284, 

NT2RM4000593, NT2RM4001321 , NT2RP1 000002, NT2RP1 000050, NT2RP1000181 , NT2RP1 000261, 



15 NT2RP1 000465, NT2RP1 000468, NT2RP1 000579 



NT2RP1 000679, NT2RP2000092, 

NT2RP2000694, NT2RP2000903, NT2RP2001 388, 

NT2RP2002304, NT2RP2002721 , NT2RP2002824, 
NT2RP20031 79, NT2RP2003302, 
NT2RP2004069, NT2RP20041 08, 

NT2RP2005247, NT2RP2005425, NT2RP2005463, 

NT2RP2005774, NT2RP2005878, NT2RP2005883, 

NT2RP30001 25, NT2RP30001 71 , 



NT2RP2003383, 
NT2RP2004524, 



NT2RP200061 0, NT2RP2000663, 

NT2RP2001 878, NT2RP2001 01 5, 

NT2RP2002974, NT2RP2002976, 

NT2RP2003664, NT2RP2003940, 

NT2RP2004670, NT2RP2005069, 

NT2RP2005535, NT2RP2005541 , 

NT2RP2006099, NT2RP2006134, NT2RP3000011 

NT2RP3000460, NT2RP3000481 , NT2RP3000652, NT2RP3000677, NT2RP3000818, NT2RP3000820, 



NT2RP2000479, 
NT2RP2001538, 
NT2RP2002942, 
NT2RP2003469, 
NT2RP2004556, 
NT2RP2005514, 
NT2RP2005887, 
NT2RP3000232, 



NT2RP3001044, NT2RP3001061 , NT2RP3001170, 
NT2RP3001 542, NT2RP3001 592, NT2RP3001 976, 



NT2RP3001240, NT2RP3001322, NT2RP3001388, 
NT2RP3002790, NT2RP3002900, NT2RP3002983, 



NT2RP3003000, NT2RP3003354, NT2RP3003532, NT2RP3003729, NT2RP3003874, NT2RP3003939, 



NT2RP3004025, NT2RP3004083, NT2RP3004090, NT2RP3004130, 
NT2RP3004640, NT2RP4000108, 



NT2RP3004202, 
NT2RP4002715, 



NT2RP3004294, 
OVARC1 000090, 



NT2RP4000634, NT2RP4002451 , 
OVARC1 000208, OVARC1 000275, OVARC1 000553, OVARC1 000775, OVARC1 000853, OVARC1 000873, 
OVARC1000916, OVARC1 000995, OVARC1001030, OVARC1001049, OVARC1001132, OVARC1001596, 



OVARC1 0021 78, PLACE1 000258, PLACE 1 000442, PLACE1 000927, 
PLACE1001123, PLACE1001795, PLACE1002518, PLACE1 002547, 
PLACE1 003428, PLACE1 003644, PLACE1 003839, PLACE1 004078, 
PLACE1 005669, PLACE1 005682, PLACE 1005736, PLACE1 005768, PLACE1005815, 

PLACE1008984, 



FLACE1 000986, PLACE1 001 1 00, 
PLACE1 002967, PLACE1 003407, 
PLACE1 004441, PLACE1 004450, 



PLACE1 006208, PLACE1 007296, PLACE1 007626, PLACE1 008282, 
PLACE1 01 0445, PLACE1 01 1 708, PLACE1 01 1 978, PLACE4000455, 
THYR01 000580, THYR01 000776, THYR01 000999, THYRO1001063, THYRO1001128, THYRO1001205, 



PLACE1 006073, 
PLACE1 008985, 
SKNMC1 000004, THYR01 000036, 



THYR01 001 523, THYR01 001 725, 
Y79AA1 001 056, Y79AA1 001 062, 



THYR01 001 770, Y79AA1 000207, Y79AA1 000226, 
Y79AA1 001 090, Y79AA1 001 727, Y79AA1 00221 3, 



THYRO1001327, 
Y79AA1 000270, 
Y79AA1 002381. 

[0225] Clones of which expression levels decreased by RA are as follows: BNGH41 000020, HEMBA1 005070, 
NT2RP2005027, NT2RP3003473, Y79AA1 002376. 

[0226] Clones of which expression levels increase by RA/inhibitor are as follows: 

HEMBA1000128, HEMBA1000875, HEMBA1001390, HEMBA1002163, HEMBA1002227, HEMBA1002421, 

HEMBA1004391, HEMBA1004454, HEMBA1004785, HEMBA1005913, HEMBA1006171 , HEMBA1006299, 

HEMBA1 006335, HEMBA1 006544, HEMBA1 007241 , HEMBB1 000447, HEMBB1 000668, MAMMA1 000994, 

MAMMA1001344, NT2RM2000582, NT2RP1 001004, NT2RP2000663, NT2RP2000694, NT2RP2000903, 

NT2RP2001388, NT2RP2002674, NT2RP2002974, NT2RP2003383, NT2RP2004069, NT2RP2004606, 

NT2RP2004837, NT2RP2005069, NT2RP2005425, NT2RP2005463, NT2RP2005541 , NT2RP2005883, 



NT2RP2005887, NT2RP3000460, 
NT2RP3002721 , NT2RP3002738, 
NT2RP3004294, NT2RP3004640, 



NT2RP3000838, NT2RP3001044, NT2RP3001240, NT2RP3001388, 
NT2RP3003469, NT2RP3004083, NT2RP30041 30, NT2RP3004202, 
NT2RP4000108, NT2RP4002451, NT2RP4002715, OVARC1 000275, 



OVARC1 000467, OVARC1 000553, OVARC1 000853, OVARC1 000873, OVARC1000916, OVARC1 000995, 



OVARC1001030, OVARC1 001222, 
PLACE1 001 1 23, PLACE 1 001 407, 
PLACE1 003407, PLACE1 003644, 



OVARC1 001 596, OVARC1 002058, OVARC1 0021 78, PLACE1 000927, 

PLACE1 001 464, PLACE 1 001 564, PLACE1 001 795, PLACE1 002547, 

PLACE1 003845, PLACE1 004441 , PLACE1 004482, PLACE1 00541 0, 

PLACE1 005601, PLACE1 005725, PLACE1 005736, PLACE1 006093, PLACE1006219, PLACE1 006290, 

PLACE1006716, PLACE1 007296, PLACE1 007626, PLACE1 008359, PLACE1010968, PLACE1011364, 
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PLACE1011824, THYR01 000678, THYR01 000776, THYR01 000999, THYRO1001113, THYRO1001237, 
THYRO1001523, Y79AA1 000226, Y79AA1 000888, Y79AA1001430. 

[0227] Clones of which expression levels decrease by RA/inhibitor are as follows: HEMBA1 000349, 

HEMBA1001297, HEMBA1001878, HEMBA1 005070, HEMBA1 006482, HEMBB1001959, NT2RM2001939, 

5 NT2RP1 000981, NT2RP2001 469, NT2RP3003473, OVARC1001132, PLACE1001655, Y79AA1000127, 
Y79AA1 002381. 

[0228] Clones of which expression levels increase in the presence of both RA and RA/inhibitor are as follows: 

HEMBA1 000875, HEMBA1001390, HEMBA1002163, HEMBA1 002227, HEMBA1 002421, HEMBA1 004391, 

HEMBA1005913, HEMBA1 006299, H EM BA1 006544, HEMBA1 007241, HEMBB 1000447, HEMBB1 000668, 

W NT2RM2000582, NT2RP2000663, NT2RP2000694, NT2RP2000903, NT2RP2001388, NT2RP2002974, 

NT2RP2003383, NT2RP2004069, NT2RP2005069, NT2RP2005425, NT2RP2005463, NT2RP2 005541 , 

NT2RP2005883, NT2RP2005887, NT2RP3000460, NT2RP3001044, NT2RP3001240, NT2RP3001388, 

N12RP3004083, NT2RP3004130, NT2RP3004202, NT2RP3004294, NT2RP3004640, NT2RP4000108, 

NT2RP4002451, NT2RP4002715, OVARC1 000275, OVARC1 000553, OVARC1 000853, OVARC1 000873, 

15 OVARC1000916, OVARC1 000995, OVARC1001030, OVARC1001596, OVARC1002178, PLACE1 000927, 

PLACE1001123, PLACE1001795, PLACE1002547, PLACE 1003407, P LAC E1 003644, PLACE1004441 , 

PLACE1 005736, PLACE1 007296, PLACE1 007626, THYR01 000776, THYR01 000999, THYRO1001523, 
Y79AA1 000226. 

[0229] Clones of which expression levels decrease in the presence of both RA and RA/inhibitor are as follows: 
20 HEMBA1 005070 and NT2RP3003473. 

[0230] These are neurological disease-associated clones. 



Analysis of rheumatoid arthritis-associated genes 



25 [0231] The onset of rheumatoid arthritis is thought to be involved in the proliferation of synovial cells covering inner 
surfaces of joint cavity and in inflammatory reaction resulted from the action of cytokines produced by leukocytes 
infiltrating into the joint synovial tissues (Rheumatism Information Center. http://www.rheuma-net.or.jp/). Recent studies 
have also revealed that tissue necrosis factor (TNF)-. participates in the onset (Current opinion in immunology 1999, 
11 , 657-662). When the expression of a gene exhibits responsiveness to the action of TNF on synovial cells, the gene 

30 is considered to be involved in rheumatoid arthritis. 

[0232] A survey was performed for genes of which expression levels are varied in response to TNF-. in the primary 
cell culture of synovial tissue. The primary cultured cells of the smooth muscle (Cell Applications) were grown to be 
confluent in a culture dish, and then, human TNF-. (Boehringer-Mannheim) was added at a final concentration of 10 
ng/ml thereto. The culture was further continued for 24 hours. 

35 [0233] Total RNA was extracted from the cells by using S.N.A.P.<™> Total RNA Isolation kit (Invitrogen). The labeling 
of probe used for hybridization was carried out by using 10.g of the total RNA according to the same methods as 
described above. The data were obtained in tripricate (n=3). The data of signal value representing gene expression 
level in the cells in the presence of TNF stimulation were compared with those in the absence of the stimulation. The 
comparison was performed by statistical treatment of two-sample t-test. Clones with significant difference in the signal 

40 distribution were selected under the condition of p<0.05. In this analysis, clones with the difference can be statistically 
detected even when the signals were low. Accordingly, clones with signal value of 40 or less were also assessed for 
the selection. 

[0234] Table 366 shows the expression level of each cDNA in synovial cells cultured in the absence or presence of 
TNF. 

45 [0235] Averaged signal values (M 1( M 2 ) and sample variances (s^, s 2 2 ) for each gene were calculated in each of 
the cells, and then, the pooled sample variances s 2 were obtained from the sample variances of the two types of cells 
to be compared. The t-values were determined according to the following formula: t=(M 1 -M 2 )/s/(1/3+1/3) 1/2 . When the 
determined t-value was greater than a t-value at P, which means the probability of significance level, of 0.05 or 0.01 
in the t-distribution table with 4 degrees of freedom, the difference was judged to be found in the expression level of 

so the gene between the two types of cells at p<0.05 or p<0.01 , respectively. The tables also include the information of 
an increase (+) or decrease (-) in the expression level of a gene in the stimulated cells when the level is compared 
with that of unstimulated cells. 

[0236] Clones of which expression levels are elevated by TNF-. are as follows: 

BNGH41000020, HEMBA1000349, H EM BA1 000634, HEMBA1000671, HEMBA1000835, HEMBA1000962, 
55 HEMBA1002178, HEMBA1002195, HEMBA1002239, HEMBA1002420, HEMBA1002524, HEMBA1002992, 
HEMBA1003315, HEMBA1003392, HEMBA1003487, HEMBA1003602, HEMBA1004067, HEMBA1004797, 
HEMBA1 005337, HEMBA1 005489, HEMBA1006916, HEMBB1 000668, HEMBB1 000905, HEMBB1001547, 
HEMBB1001573, HEMBB1 002041 , HEMBB1 002663, MAMMA1 000652, MAMMA1 000810, MAMMA1001634, 
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10 



MAMMA1 002091, 
NT2RP1 000740, 
NT2RP2005163, 
NT2RP3001976, 
NT2RP3004647, 
OVARC1 000275, 
PLACE1001100, 
PLACE1 003644, 
PLACE1 006809, 
Y79AA1 000270, 



MAMMA1 002234, NT2RM2000306, 
NT2RP2000694, NT2RP2001921 , 



NT2RP2005463, 
NT2RP3003473, 
NT2RP4000108, 
OVARC1 000439, 
PLACE1001464, 
PLACE1004519, 
PLACE1008716, 
Y79AA1 000876, 



NT2RP2006134, 
NT2RP3003874, 
NT2RP4001001, 
OVARC1001607, 
PLACE1001500, 
PLACE1 005031, 
PLACE1010081, 
Y79AA1001013, 



NT2RM4000417, 
NT2RP2002527, 
NT2RP300O171, 
NT2RP3004090, 
NT2RP4001877, 

PLACE1 000740, 
PLACE1001918, 
PLACE1 005410, 
THYRO1001770, 
Y79AA1001264, 



15 



Y79AA1001430, Y79AA1001 530, Y79AA1001799. 

[0237] Clones of which expression levels decrease by TNF-. are as follows: 
NT2RM4000326, NT2RP1 000300, NT2RP200051 4, NT2RP2001 755, 
NT2RP3002790. These are rheumatoid arthritis-associated clones. 

EXAMPLE 16 



NT2RP1 000002, 
NT2RP2004495, 
NT2RP3000652, 
NT2RP3004294, 
OVARC1 000090, 
PLACE1 000927, 
PLACE1 002095, 
PLACE1 005736, 
Y79AA1000127, 
Y79AA1001272, 



NT2RP1000181, 
NT2RP2004606, 
NT2RP3001195, 
NT2RP3004557, 
OVARC1000105, 
PLACE1001016, 
PLACE1 002547, 
PLACE1006219, 
Y79AA1 000207, 
Y79AA1001328, 



NT2RP2006042, NT2RP3000481 , 



20 



25 



30 



35 



40 



45 



50 



55 



Search for a signal sequence, transmembrane region and functional domain in deduced amino acid sequences 

[0238] The deduced amino acid sequences from the full-length nucleotide sequences were examined to predict the 
presence of a signal sequence in their amino-termini as well as the presence of a transmembrane region. The amino 
acid sequences were also searched for a protein functional domain (motif). The examinations for a signal sequence 
in the amino-terminus, for a transmembrane region and for a functional domain were performed by using PSORT [K. 
Nakai & M. Kanehisa, Genomics, 14:897-911 (1992)], SOSUI [T Hirokawa et al., Bioinformatics, 14:378-379 (1998)] 
(Mitsui Knowledge Industry Co., Ltd.) and Pfam (http://www.sanger.ac.uk/Software/Pfam/index.shtml ), respectively. 
When the presence of a signal sequence or a transmembrane region in the amino-terminus was predicted in the amino 
acid sequence by PSORT or SOSUI, the protein was predicted to be a secretory protein or a transmembrane protein. 
When the amino acid sequence matched a functional domain in the Pfam search for a functional domain, the function 
of the protein is predictable based on the matching data, for example, by referring to the functional categories in 
PROSITE (http://www.expasy.ch/cgi-bin/prosite-nst.pl ). The functional domain search can be performed by using 
PROSITE instead of Pfam. 

[0239] Search results obtained by using the respective software programs are indicated below. 

[0240] Clones whose deduced amino acid sequences were predicted to have signal sequences by PSORT search 

are as follows: 

HEMBA1 002421, HEMBA1003101 , HEMBA1004110, 
HEMBB1 001 573, HEMBB1 001 978, HEMBB1 0021 62, 
MAMMA1000118, MAMMA1 000457, MAMMA1001043, 
MAMMA1002165, MAMMA1 002633, NT2RM2000241 , 
NT2RP1001563, 



HEMBA1000713, 
HEMBA1 006902, 
HEMBB1 002427, 
MAMMA1001893 
NT2RM2001941, 
NT2RP2003383, 
NT2RP2004524, 
NT2RP3003076, 
OVARC1 000298, 
PLACE1001564, 
PLACE1 005250, 
PLACE 1006277, 
PLACE1011236, 
THYRO1001327, 
Y79AA1 000959, 



HEMBA1 002420, 
HEMBB1001530, 
MAMMA1000102, 
MAMMA1 002070, 
NT2RM4001325, 
NT2RP2003593, 
NT2RP2005463, 
NT2RP3003532, 
OVARC1 000811, 
PLACE1 002095, 
PLACE1 005383, 
PLACE1 006809, 
SKNMC1 000004, 
THYR01 001523, 
Y79AA1001013, 



NT2RP2003655, 
NT2RP3000460, 
NT2RP3004133, 

PLACE1 000231, 
PLACE1 002374, 
PLACE1 005410, 
PLACE1 007626, 
SKNMC1 000014, 

THYRO1001702, 
Y79AA1001264, 



NT2RP2001495, 
NT2RP2003664, 
NT2RP3001012, 
NT2RP3004309, 
PLACE1 000740, 
PLACE1 003839, 
PLACE1 005544, 
PLACE1 008359, 
THYR01 000099, 
THYRO1001725, 
Y79AA1001328, 



NT2RP2002063, 
NT2RP2004179, 
NT2RP3001858, 
NT2RP4001467, 
PLACE1001183, 
PLACE1001482, 
PLACE1 005569, 
PLACE1 009067, 
THYRO1000196, 
Y79AA1 000426, 
Y79AA1001427, 



HEMBA1 006707, 
HEMBB1002245, 
MAMMA1001344, 
NT2RM2000410, 
NT2RP2002721, 
NT2RP2004205, 
NT2RP3002836, 
NT2RP4002451, 
PLACE1001536, 
PLACE1 005005, 
PLACE1 006093, 
PLACE1010251, 
THYRO1001237, 
Y79AA1 000521, 
Y79AA1001430, 



Y79AA1001795, Y79AA1001803, Y79AA1 002022, 

[0241] Clones whose deduced amino acid sequences were predicted to have transmembrane regions by SOSUI 
search are as follows: BNGH41000091, HEMBA1000121 , HEMBA1000349, HEMBA1000477, HEMBA1000713, 



HEMBA1 000940, 
HEMBA1002195, 
HEMBA1 003732, 
HEMBA1 005449, 
HEMBA1 006311, 
HEMBB1 000407, 



HEMBA1 000962, 
HEMBA1 002227, 
HEMBA1 003945, 
HEMBA1 005522, 
HEMBA1 006335, 
HEMBB1 000447, 



HEMBA1001221, 
HEMBA1 002421, 
HEMBA1 004391, 
HEMBA1 005545, 
HEMBA1 006357, 
HEMBB1 000567, 



HEMBA1001228, 
HEMBA1003101, 
HEMBA1 004454, 
HEMBA1 005698, 
HEMBA1 006430, 
HEMBB1 000679, 



HEMBA1001621, 
HEMBA1 003392, 
HEMBA1 004797, 
HEMBA1006171, 
HEMBA1 006724, 
HEMBB1 000905, 



HEMBA1002167, 
HEMBA1 003530, 
HEMBA1 004982, 
HEMBA1 006299, 
HEMBA1 006960, 
HEMBB1001026, 
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HEMBB1 001407, HEMBB1001573, HEMBB1O01978, HEMBB1002041 

HEMBB1002427, HEMBB1002693, MAMMA1000102, MAMMA1000106 

MAM MA 1000204, MAMMA1 000226, MAMMA1 000457, MAMMA1 000473 

MAMMA1 000810, MAMMA1 000986, MAMMA1001043, MAMMA1001141 

MAMMA1001893, MAMMA1001957, MAMMA1001978, MAMMA1 002070, 

MAMMA1 002633, NT2RM1 000580, NT2RM1 000855, NT2RM 1000858, 

NT2RM2001 626, NT2RM2001 939, NT2RM2001 941 , NT2RM4000444 

NT2RM4000997, NT2RM4001 325, NT2RM4001 735, NT2RM4001 768 ; 

NT2RP1 0001 81 , NT2RP1 000261 , NT2RP1 000300, NT2RP1 000448 



NT2RP1 000981, NT2RP1001563, NT2RP2000479, NT2RP2000533, NT2RP2000649, 



NT2RP2000663, 
NT2RP2001495, 
NT2RP2002533, 
NT2RP2003390, 
NT2RP2004179, 
NT2RP2004648, 
NT2RP2005666, 
NT2RP2006580, 
NT2RP3000721, 
NT2RP3001240, 
NT2RP3001650, 
NT2RP3002790, 
NT2FP3003532, 
NT2RP3004090, 
NT2RP3004481, 



NT2RP2000694, 
NT2RP2001915, 
NT2RP2002721, 
NT2RP2003469, 
NT2RP2004205, 
NT2RP2004794, 
NT2RP2005883, 
NT2RP3000169, 
NT2RP3000818, 
NT2RP3001271, 
NT2RP3001738, 
NT2RP3002836, 
NT2RP3003535, 
NT2RP3004130, 
NT2RP3004552, 



NT2RP40021 87, NT2RP4002451 , 



NT2RP2000818, 
NT2RP2001956, 
NT2RP2002824, 
NT2RP2003593, 
NT2RP2004495, 
NT2RP2005163, 
NT2RP2005994, 
NT2RP3000171, 
NT2RP3000907, 
NT2RP3001322, 
NT2RP3002015, 
NT2RP3002900, 
NT2RP3003614, 
NT2RP3004294, 
NT2RP4001001, 
NT2RP4002750, 



NT2RP2000903 
NT2RP2002188 
NT2RP2002942 
NT2RP2003655 
NT2RP2004524, 
NT2RP2005181 
NT2RP2006004, 
NT2RP3000304, 
NT2RP3000921 
NT2RP3001388, 
NT2RP3002311 
NT2RP3002958 ; 
NT2RP3004025 
NT2RP3004309 
NT2RP4001009 ; 
OVARC1 000003, 



OVARC1 000439, OVARC1 000553, OVARC1001030, OVARC1001336, 

OVARC1001570, PLACE1 000231, PLACE 1000560, PLACE1 000740, PLACE1000912, PLACE1 000914, 

PLACE1 000927, PLACE1001016, PLACE1001183, PLACE1001231 , PLACE1001401 , PLACE1001407, 

PLACE1001464, PLACE1001536, PLACE1001564, PLACE1001 655, PLACE1001836, PLACE1 00191 8, 

PLACE1001949, PLACE1002518, PLACE 1002726, PLACE1002967, PLACE1003573, PLACE1003737, 

PLACE1 003839, PLACE1 003845, PLACE1 003852, PLACE1 004279, PLACE1 004282, PLACE1 004441, 

PLACE1 004637, PLACE1 004648, PLACE1004816, PLACE1 004887, PLACE1 005003, PLACE1 005005, 

PLACE1 005410, PLACE1 005544, PLACE1 005569, PLACE1 005660, PLACE1 005725, PLACE1 005745, 

PLACE1 005927, PLACE1 006290, PLACE1 006443, PLACE1 006959, PLACE1 007096, PLACE1 007296, 

PLACE1 007626, PLACE1 007881 , PLACE1 008359, PLACE1 008469, PLACE1008716, PLACE1 008985, 

PLACE1009196, PLACE1 009279, PLACE1 009527, PLACE1 009546, PLACE1 009600, PLACE1010011 , 

PLACE1010078, PLACE1010445, PLACE1010713, PLACE1010784, PLACE1010968, PLACE1011236, 

PLACE1011516, PLACE3000181, THYR01 000400, THYR01 000678, THYR01 000776, THYR01 000956, 

THYRO1001102, THYRO1001113, THYRO1001205, THYRO1001237, THYRO1001242, THYR01 001266, 

THYRO1001327, THYRO1001478, THYRO1001523, THYRO1001 641 , THYRO1001702, THYRO1001725, 

Y79AA1 000207, Y79AA1 000226, Y79AA1 000270, Y79AA1 000521 , Y79AA1 000888, Y79AA1001013, 

Y79AA1001212, Y79AA1001264, Y79AA1001328, Y79AA1001426, Y79AA1001427, Y79AA1001727, 
Y79AA1001787, Y79AA1001795, Y79AA1001803, Y79AA1 002058, 
Y79AA1002129, Y79AA1002213, Y79AA1 002373, 

[0242] Names of clones whose deduced amino acid sequences were predicted to have functional domains by Pfam 
search, and names of the matched functional domains are shown below. When multiple functional domains matched 
a clone, each domain name was indicated, separated by a double-slash mark,//. 



HEMBB1002162, 
MAMMA1000118, 
MAMMA1 000591 
MAMMA1001237, 
MAMMA1 002091 
NT2RM2000410 
NT2RM4000587, 
NT2RM4002352 
NT2RP1 000551 



NT2RP2001200, 
NT2RP2002232 
NT2RP2002976, 
NT2RP2003664, 
NT2RP2004556, 
NT2RP2005463, 
NT2RP2006269, 
NT2RP3000460 
NT2RP3001159, 
NT2RP3001560 
NT2RP3002342, 
NT2RP3003000 
NT2RP3004075, 
NT2RP3004345 
NT2RP4001467 
OVARC1000105, 



HEMBB1 002245, 
MAMMA1000141, 
MAMMA1 000681, 
MAMMA1001344, 
MAMMA1 002095, 
NT2RM2000565, 
NT2RM4000648, 
NT2RP1 000050, 
NT2RP1000613, 

NT2RP2001276, 
NT2RP2002527, 
NT2RP2003042, 
NT2RP2003950, 
NT2RP2004606, 
NT2RP2005597, 
NT2RP2006512, 
NT2RP3000616, 
NT2RP3001195, 
NT2RP3001592, 
NT2RP3002411, 
NT2RP3003354, 
NT2RP3004083, 
NT2RP3004406, 
NT2RP4001879, 
OVARC 1000307, 



HEMBA1000006//Src homology domain 3 
HEMBA1 0001 28//SCP-like extracellular Proteins 
HEMBA1000349//ABC transporters 

HEMBA1000462//RNA recognition motif, (aka RRM, RBD, or RNP domain) 
HEMBA1 000590//EGF-like domain//von Willebrand factor type A domain 
HEMBA1000671//Zinc finger, C2H2 type 
HEMBA1000732//EGF-like domain 
HEMBA1 000940//Connexin 

HEMBA1001221//EGF-like domain//Kazal-type serine protease inhibitor domain 
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HEMBA1001621//7 transmembrane receptor (rhodopsin family) 

HEMBA1001878//WD domain, G-beta repeats 

HEMBA1002048//Zinc finger, C2H2 type 

HEMBA1 0021 67//Carboxylesterases 
5 HEMBA1002551//WD domain, G-beta repeats 

HEMBA1 002992//Ubiquitin family 

HEMBA1003047//CUB domain 

HEMBA1003120//Zinc finger, C2H2 type 

HEMBA1003230//EGF-like domain 
10 HEMBA1 003392//Low-density lipoprotein receptor domain class A 

HEMBA1003497//Zinc finger, C2H2 type 

HEMBA1 004250//Cadherin 

HEMBA1004391//Ftbronectin type III domain//IG superfamily 
HEMBA1004454//4 transmembrane segments integral membrane proteins 
15 HEMBA1004785!/ , chromo l (CHRromatin Organization Modifier) domain 

HEMBA1005246//Zinc finger, C2H2 type 
HEMBA1005267//Ank repeat 

HEMBA1005545//7 transmembrane receptor (rhodopsin family) 
HEMBA1005929//Eukaryotic protein kinase domain 
20 HEMBA1 005945//Mitochondrial carrier proteins 

HEMBA1006572//Zinc finger, C2H2 type 

HEMBA1006707//EGF-like domain//von Willebrand factor type A domain 

HEMBA1006749//EGF-like domain//von Willebrand factor type A domain 

HEMBA1006770//RNA recognition motif, (aka RRM, RBD, or RNP domain) 
25 HEMBA1 006902//EGF-like domain//von Willebrand factor type A domain 

HEMBB1 0001 06//Zinc finger, CCHC class 

HEMBB1000668//WD domain, G-beta repeats 

HEMBB1000881//Thrombospondin type 1 domain 

HEMBB1000905//WD domain, G-beta repeats 
30 HEMBB1002041//EGF-like domain//Kazal-type serine protease inhibitor domain 

HEMBB1002245//IG superfamily 

HEMBB1002302//Zinc finger, CCHC class 

HEMBB1 002465//Acyl-CoA dehydrogenases 

HEMBB1002661//Helix-loop-helix DNA-binding domain 
35 MAMMA1 000204//C2 domain 

MAMMA1000457//FAD/NAD-binding domain in oxidoreductases 

MAMMA1000681//7 transmembrane receptor (rhodopsin family) 

MAMMA1000881//Eukaryotic protein kinase domain//Protein kinase C terminal domain 

MAM MA1 0011 50//Phorbol esters / diacylglycerol binding domain//Eukaryotic protein kinase domain 
40 MAMMA1001310//WD domain, G-beta repeats 

MAMMA1001532//Zinc finger, C2H2 type 

MAMMA1001615//Helix-loop-helix DNA-binding domain 

MAMMA1 002070//Kringle domain 

MAMMA1002080//Ras family (contains ATP/GTP binding P-loop) 
45 MAMMA1 002095//E1 -E2 ATPases 

MAMMA1002165//lnsulin-like growth factor binding proteins 

NT2RM1000789//HMG (high mobility group) box 

NT2RM1000855//eubacterial secY protein 

NT2RM1000899//Mitochondrial carrier proteins 
so NT2RM2000589//PH (pleckstrin homology) domain 

NT2RM2000632//Helicases conserved C-terminal domain 

NT2RM2001792//Fibrinogen beta and gamma chains, C-terminal globular domain 

NT2RM2001902//Eukaryotic protein kinase domain 

NT2RM2001 939//7 transmembrane receptor (rhodopsin family) 
55 NT2RM2001 941 117 transmembrane receptor (rhodopsin family) 

NT2RM4000284//Class I Histocompatibility antigen, domains alpha 1 and 2 

NT2RM4000326//Zinc finger, C2H2 type 

NT2RM4000417//C2 domain 
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Y79AA1 001 427//FAD/NAD-binding domain in oxidoreductases 
Y79AA1001523//Bromodomain//Zinc finger, C3HC4 type (RING finger) 
Y79AA1001530//Tubulin 
Y79AA1001727//IG superfamily 
5 Y79AA1001787//E1-E2 ATPases 

Y79AA1001799//Mitochondrial carrier proteins 
Y79AA1002022//IG superfamily 
Y79AA1002381//Eukaryotic protein kinase domain 



10 EXAMPLE 17 



Functional categories based on the full-length nucleotide sequences 



[0243] Prediction of functions of proteins encoded by the clones and the categorization thereof were performed based 
15 on the results of homology search (see homology search result 10) of the databases, GenBank, Swiss-Prot and Uni- 
Gene for the full-length nucleotide sequences of 826 clones as well as based on the results of domain search (see 
Example 16) of the deduced amino acid sequences encoded by the fult-length nucleotide sequences. 
(HEMBA1 005337, NT2RM1 000407, NT2RM2001767, and NT2RP3003939 were excluded because of the absence of 
full-length sequence.) 

20 [0244] There are 611 clones that presumably encode proteins belonging to any of categories of secretory and/or 
membrane proteins, glycoprotein-associated proteins, signal transduction-associated proteins, transcription-associat- 
ed proteins and disease-associated proteins. 

[0245] the clones presumably encoding proteins categorized into secretory and/or membrane proteins are those 
which matched the full-length sequences of Swiss-Prot database with keywords "growth factor", "cytokine", "hormone", 

25 "signal", "transmembrane", "membrane", "extracellular matrix", "receptor", "G-protein coupled receptor", "ionic chan- 
nel", "voltage-gated channel", "calcium channel", "cell adhesion", "collagen" or "connective tissue"; those which 
matched the data, suggesting that the proteins are secretory and/or membrane proteins; or those which matched the 
full-length sequences of GenBank or UniGene database with similar description; and, further, those predicted to have 
an N-terminal signal sequence or a transmembrane region as a result of domain search for the amino acid sequences 

30 deduced from the full-length nucleotide sequences. 

[0246] The clones presumably encoding proteins categorized into glycoprotein-associated proteins are those which 
matched the full-length sequences of Swiss-Prot database with the keywords "glycoprotein"; those which matched the 
data, suggesting that the proteins are glycoprotein; or those which matched the full-length sequences of GenBank or 
UniGene database. 

35 [0247] The clones presumably encoding proteins categorized into signal transduction-associated proteins are those 
which matched the full-length sequences of Swiss-Prot database with the keywords "serine/threonine-protein kinase", 
"tyrosine-protein kinase" or "SH3 domain"; those which matched the data, suggesting that the proteins are signal 
transduction-associated proteins (for example, "ADP-ribosylation factor"); or those which matched the the full-length 
sequences of GenBank or UniGene database with similar description. 

40 [0248] The clones presumably encoding proteins categorized into transcription-associated proteins are those which 
matched the full-length sequences of Swiss-Prot database with the keywords "transcription regulation", "zinc finger" 
or "homeobox"; those which matched the data, suggesting that the proteins are transcription-associated proteins; or 
those which matched the full-length sequences of GenBank or UniGene database with similar description. 
[0249] The clones presumably encoding proteins categorized into disease-associated proteins are those which 

45 matched the full-length sequences of Swiss-Prot database with the keywords "disease mutation" or "syndrome"; those 
which matched the data, suggesting that the proteins are disease-associated proteins; or those which matched the 
full-length sequences of Swiss-Prot database and GenBank or UniGene database where the matched sequences are 
those of genes or proteins which had been deposited in the database of Online Mendelian Inheritance in Man (OMIM) 
(http:l/www.ncbi.nlm.nih.gov/Omim/), which is a database of human genes and diseases. 

so [0250] The following 437 clones were categorized into secretory and/or membrane proteins. 

■BNGH41 000020, BNGH41 000087, BNGH41 000091 , HEMBA1 0001 21 , HEMBA1 0001 28, HEMBA1 000349, 
HEMBA1000477, HEMBA1000590, HEMBA1000713, HEMBA1000732, HEMBA1000745, HEMBA1000835, 
HEMBA1000940, HEMBA1000962, HEM8A1001221 , HEMBA1001228, HEMBA1001621 , HEMBA1002131, 
HEMBA1002163, HEMBA1002167, HEMBA1002178, HEMBA1002195, HEMBA1002227, HEMBA1002420, 

55 HEMBA1 002421, HEMBA1 002767, HEMBA1 003047, HEMBA1003101, HEMBA1 003230, HEMBA1 003392, 
HEMBA1 003530, HEMBA1 003602, HEMBA1 003732, HEMBA1 003945, HEMBA1004110, HEMBA1 004250, 
HEMBA1 004391, HEMBA1 004444, HEMBA1 004454, HEMBA1 004505, HEMBA1 004797, HEMBA1 004982, 
HEMBA1 005070, HEMBA1 005449, HEMBA1 005522, HEMBA1 005545, HEMBA1 005698, HEMBA1 005945, 
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HEMBA1006171, 

HEMBA1 006482, 

HEMBA1 007241, 

HEMBB1001026, 

HEMBB1001978, 

MAMMA1000102 

MAMMA1 000457 

MAMMA1 000986 

MAMMA1001418 

MAMMA1 002070 

MAMMA1 002633 

NT2RM1 000858 

NT2RM2001626 

NT2RM4000417, 

NT2RM4000997 

NT2RP1 000050, 

NT2RP1 000551 

NT2RP2000533, 

NT2RP2000903, 

NT2RP2001755, 

NT2RP2002527, 

NT2RP2003042 

NT2RP2003655, 

NT2RP2004524, 

NT2RP2005163, 

NT2RP2005666 

NT2RP2006042 

NT2RP3000304 

NT2RP3000818 

NT2RP3001240 

NT2RP3001650 

NT2RP3002342 

NT2RP3002958 

NT2RP3003614, 

NT2RP3004309 

NT2RP3004647, 

NT2RP4002451 

OVARC1000313 

OVARC1 000956 

OVARC1001725, 

PLACE1000912, 

PLACE1001231 

PLACE1001536, 

PLACE1001836, 

PLACE1 002374 

PLACE1 002967 

PLACE1 003845 

PLACE1 004482 

PLACE1 005003 

PLACE1 005544 

PLACE1 005927, 

PLACE1 006443 

PLACE1 007296 

PLACE1008716 

PLACE1 009527, 

PLACE1010251 

PLACE1011116, 

PLACE3000213, 



HEMBA1 006299, HEMBA1 00631 1 , H EMBA1 006335, 

HEMBA1006707, HEMBA1006724, HEMBA1006749, 

HEMBB1000407, HEMBB1000447, HEMBB1000567, 

HEMBB1001048, HEMBB1001407, HEMBB1001530, 

HEMBB1002041, HEMBB1002162, HEMBB1002245, 

MAMMA1000106, MAMMA1000118, MAMMA1000141, 

MAM MA1 000473, MAMMA1 000496, MAMMA1 000591, 

MAMMA1 000994, MAMMA1001043, MAMMA1001141 , 
MAMMA1001893, MAMMA1001957, MAMMA1001978, 

MAMMA1 002091, MAMMA1 002095, MAMMA1002165, 

MAMMA1 0031 26, NT2RM1 000462, NT2RM1 000542, 

NT2RM1 000899, NT2RM2000241 , NT2RM2000410, 

NT2RM2001 792, NT2RM2001 939, NT2RM2001 941 , 

NT2RM4000444, NT2RM4000587, NT2RM4000593, 

NT2RM4001 325, NT2RM4001 735, NT2RM4001 768, 

NT2RP1 0001 81 , NT2RP1 000261 , NT2RP1 000300, 

NT2RP1000613, NT2RP1 000981 , NT2RP1001004, 



NT2RP200061 6, NT2RP2000649, 
NT2RP2001200, NT2RP2001276, 



NT2RP2001915, 
NT2RP2002533, 
NT2RP2003210, 



NT2RP2003664, NT2RP2003950 
NT2RP2004556, NT2RP2004606 



NT2RP2000663, 
NT2RP2001480, 
NT2RP2002063, 
NT2RP2002824, 
NT2RP2003390, 
NT2RP2004179, 
NT2RP2004648, 
NT2RP2005378, NT2RP2005463, 



NT2RP2001956 : 
NT2RP2002721 : 
NT2RP2003383 



NT2RP2005181 
NT2RP2005883, NT2RP2005994, NT2RP2006004, 

NT2RP2006269, NT2RP2006512, NT2RP2006580, 

NT2RP3000460, NT2RP300061 6, 

NT2RP3000921 , NT2RP3001 01 2, 

NT2RP3001 322, NT2RP3001 388, 

NT2RP3001 858, NT2RP300201 5, 



NT2RP3000436, 
NT2RP3000907, 
NT2RP3001271, 
NT2RP3001738, 
NT2RP3002411, 



NT2RP3003000, NT2RP3003076 
NT2RP3004025, NT2RP3004075 



NT2RP3002737, NT2RP3002790, 
NT2RP3003354, 



NT2RP3004083, 
NT2RP3004481, 
NT2RP4001467, 
NT2RP4002750, OVARC1 000003, OVARC1000105, 
OVARC1 000410, OVARC1 000439, OVARC1 000553, 
OVARC1 001 030, OVARC1 001 1 63, OVARC1 001 336, 
PLACE1 000033 ; 
PLACE1 000927, 
PLACE1 001 340, PLACE1 001 401 , 
PLACE1001564, PLACE1001655, PLACE1001795, 

PLACE1 001 918, PLACE1 001 949, PLACE1 002080, 
PLACE1002518, PLACE1 002547, PLACE1 002726, 
PLACE1 003407, PLACE 1 003573, PLACE1 003737, 
PLACE1 003852, PLACE1 004279, PLACE1 004282, 
PLACE1 004630, PLACE1 004637, 



NT2RP3004345, NT2RP3004406, 
NT2RP4001001, NT2RP4001009, 



OVARC1001991, 
PLACE1000914, 



PLACE1 000231, 
PLACE1001016, 
PLACE1001407, 



PLAC E 1 00503 1 , PLAC E 1 005383, 
PLACE 1 005660, PLACE1 005725, 



PLACE1 004520, 
PLACE1 005005, 
PLACE1 005569, 

PLACE1 006071 , PLACE1 006093, PLACE1 006208, 

PLACE1 00671 6, PLACE1 006809, PLACE1 006959, 

PLACE1 007845, PLACE1 007881 , 

PLACE1 008985, PLACE1 009067, 

PLACE1 009600, PLACE1 009982, 

PLACE1 01 071 3, PLACE1 01 0784, 
PLACE1011236, 



PLACE1 007626, 
PLACE1 008744 
PLACE1 009546 
PLACE1010445 
PLACE1011181, 



HEMBA1 
HEMBA1 
HEMBB1 
HEMBB1 
HEMBB1 
MAMMA1 
MAMMA1 
MAMMA1 



006357 
006902 
000679, 
001573, 
002427, 
000204, 
000681 
001237, 



PLACE1011516, 



PLACE4000354, SKNMC1 000004, SKNMC1 00001 4, 



MAMMA1 002234, 
NT2RM1 000580 
NT2RM2000423, 
NT2RM4000198, 
NT2RM4000648, 
NT2RM4001843, 
NT2RP1 000325 
NT2RP1001563 
NT2RP2000694 
NT2RP2001495 
NT2RP2002188 
NT2RP2002942 
NT2RP2003469, 
NT2RP2004205 
NT2RP2004794, 
NT2RP2005541 

NT2RP3000169, 
NT2RP3000676, 
NT2RP3001159, 
NT2RP3001560 
NT2RP3002160 
NT2RP3002836, 
NT2RP3003532 
NT2RP3004130 
NT2RP3004552 
NT2RP4001879 
OVARC1 000298 
OVARC1 000811 
OVARC1001570, 
PLACE1 000560, 
PLACE1001123, 
PLACE1001464, 

PLACE1 002095 
PLACE1 002905 
PLACE1 003772 
PLACE1 004441 
PLACE1 004648 
PLACE1 005410, 
PLACE1 005745, 
PLACE1 006277, 
PLACE1 007081 
PLACE1 008359, 
PLACE1009196, 
PLACE1010011 
PLACE1010827, 
PLACE1011708, 
SKNMC1 000082 



HEMBA1 006430, 
HEMBA1 006960, 
HEMBB1 000881, 
HEMBB1001847, 
HEMBB1 002693, 
MAMMA1 000226, 
MAMMA1000810, 
MAMMA1001344, 

MAMMA1 002586, 
NT2RM1 000855, 
NT2RM2000565, 
NT2RM4000284, 
NT2RM4000761, 
NT2RM4002352, 
NT2RP1 000448, 
NT2RP2000479, 
NT2RP2000818, 
NT2RP2001514, 
NT2RP2002232, 
NT2RP2002976, 
NT2RP2003593, 
NT2RP2004495, 
NT2RP2005027, 
NT2RP2005597, 

NT2RP3000171, 
NT2RP3000721, 
NT2RP3001195, 
NT2RP3001592, 
NT2RP3002311, 
NT2RP3002900, 
NT2RP3003535, 
NT2RP3004133, 
NT2RP3004625, 
NT2RP4002187, 
OVARC1 000307, 
OVARC1 000873, 
OVARC1001607, 
PLACE1 000740, 
PLACE1001183, 
PLACE1001516, 

PLACE1 002355, 
PLACE1 002911, 
PLACE1 003839, 
PLACE1 004450, 
PLACE1004816, 
PLACE1 005426, 
PLACE1 005878, 
PLACE1 006290, 
PLACE1 007096, 
PLACE1 008469, 
PLACE1 009279, 
PLACE1010078, 
PLACE1010968, 
PLACE3000181, 
THYR01 000036, 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



THYR01 000099, THYR01 0001 96, THYR01 000400, THYR01 000584, THYR01 000678, 
THYRO1000795, THYRO1000956, THYRO1001102, THYRO1001113, 

THYR01 001 205, THYR01 001 237, THYR01 001 242, THYR01 001 266, THYR01 00 1 327, 
THYR01 001 478, THYR01 001 523, THYR01 001 529, THYR01 001 641 , THYR01 001 702, 

Y79AA1 000270, 
Y79AA1001013, 
Y79AA1001430, 
Y79AA1 002022, 



THYR01 000776, 



Y79AA1 000226, 
Y79AA1 000959, 
Y79AA1001427, 
Y79AA1001803, 



Y79AA1 000426, 
Y79AA1001212, 
Y79AA1001727, 
Y79AA1 002058, 



Y79AA1 000521, 
Y79AA1001264, 
Y79AA1001787, 
Y79AA1 002129, 



Y79AA1 000207, 
Y79AA1 000888, 
Y79AA1001426, 
Y79AA1 001799, 
Y79AA1 002373, 

[0251] The following 146 clones were categorized into glycoprote in-associated proteins. 

BNGH41000087, BNGH41000091 , H EM BA1 000349, HEMBA1000590, HEMBA1000745, 

HEMBA1 001 221 , HEMBA1 001 228, HEMBA1 001 621 , HEMBA1 0021 31 , HEMBA1 0021 78, 

HEMBA1 002767, HEMBA1 003230, HEMBA1 003392, HEMBA1 004250, HEMBA1 004391 , 

HEMBA1 004505, HEMBA1 005449, HEMBA1 005522, HEMBA1 005545, HEMBA1 006707, 

HEMBA1006902, HEMBB1000679, HEMBB1000881, HEMBB1001048, HEMBB1002120, 

HEMBB1 002427, MAMMA1000102, MAMMA1 000591 , MAMMA1 000681, MAMMA1001043, 

MAMMA1002070, MAMMA1002586, MAMMA1003126, NT2RM1000462, NT2RM1 000580, - 

NT2RM2001 81 8, NT2RM2001 939, NT2RM2001 941 , NT2RM40001 98, NT2RM4000284, 

NT2RM4001325, NT2RM4002352, NT2RP1000613, 



THYR01 
THYROI 
Y79AA1 
Y79AA1 
Y79AA1 
Y79AA1 



001456, 
001725, 
000876, 
001328, 
001795, 
002213, 



NT2RM4000648, NT2RM4000997 

NT2RP1 001 004, NT2RP200061 6, NT2RP2000694, 

NT2RP2002533, NT2RP2003042, NT2RP200321 0, 

NT2RP20051 81 , NT2RP2005541 , NT2RP2005597, 

NT2RP2006269, NT2RP3000304, NT2RP300061 6, 

NT2RP3002737, NT2RP3002958, NT2RP3003000, 

NT2RP3004481 , NT2RP3004552 



NT2RP2000903, NT2RP2001480, 

NT2RP2004205, NT2RP2004606, 

NT2 R P2005883, NT2 R P2006004, 

NT2RP3000921, NT2RP3001650, 

NT2RP3003532, NT2RP30041 30, 

NT2RP4000108, NT2RP4001467, 



NT2RP3004640 

OVARC1 000003, OVARC1 000553, OVARC1 00081 1 , OVARC1 000873, OVARC1001336, 
OVARC1 001 991, PLACE1 000033, PLACE1 000740, PLACE1001016, 
PLACE1 001 1 23, PLACE1 001 231 , PLACE1 001 464, PLACE1 001 655, 

PLACE1 0029 1 1 , PLACE1 003573, 



HEMBA1 000835, 
HEMBA1 002421, 
HEMBA1 004444, 
HEMBA1 006749, 
HEMBB1 002245, 
MAMMA1001237, 
NT2RM2001792, 
NT2RM4000417, 
NT2RP1 000981, 
NT2RP2001755, 
NT2RP2005027, 
NT2RP2006042, 
NT2RP3002160, 
NT2RP3004133, 
NT2RP4002750, 
OVARC1001607, 



PLACE1 002905, 

PLACE1004282, PLACE1004441 



PLACE1 002374, 
PLACE1 003839, 

PLACE1 005003, PLACE1 005426, PLACE1 006071 

PLACE1 007845, PLACE1008716, 

PLACE1010968, PLACE101111 6, 

THYR01 0001 96, THYROI 000584, 



PLACE1001836, 
PLACE 1003737, 
PLACE 1 004450, PLACE1 004520, 
PLACE1 006073, PLACE1 006290, 
PLACE1 008744, PLACE1 008985, PLACE 1 01 0251 , 
PLACE3000181, PLACE3000213, PLACE4000354, 
THYROI 000956, THYROI 001 266, Y79AA1 000270, 



PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
THYROI 
Y79AA1 



002355, 
003772, 
004648, 
007081, 
010784, 
000036, 
000426, 



Y79AA1001727, Y79AA1001795, Y79AA1 002022, Y79AA1002213, 

[0252] The following 55 clones were categorized into signal transduction-associated proteins. 
HEMBA1 000006, HEMBA1 0021 95, HEMBA1 002227, HEMBA1 002551 , HEMBA1 005084, 
HEMBA1 006658, HEMBA1006916, MAMMA1 000881, MAMMA1001150, MAMMA1001310,- 



NT2RP2001469, NT2RP2001529, 
NT2RP300001 1 , NT2RP3000022, 



NT2RP3003849, NT2RP3003874, 
OVARC1000916, 
PLACE1005519, 
PLACE1011824, 
Y79AA1 000967, 



NT2RM2001 902, NT2RP1 001 020, NT2RP1 001 031 , 

NT2RP20031 79, NT2RP2003545, NT2RP2004670, 

NT2RP3000201 , NT2RP3000820, NT2RP3003527, 

NT2RP4000634, NT2RP4000962, OVARC1 000255, OVARC 1 000529, 

OVARC1 001 569, PLACE1 002329, PLACE1 0031 35, PLACE1 003598, 

PLACE1008282, PLACE1008297, PLACE1010081, PLACE1011364, PLACE1011824, 

THYROI 001 593, THYR01 001 700, THYR01 001 770, Y79AA1 000777, 
Y79AA1 002381, 

[0253] The following 80 clones were categorized into transcription -associated proteins. 

HEMBA1000462, HEMBA1000671 , HEMBA1001297, HEMBA1001390, HEMBA1001886, 

HEMBA1 0031 20, HEMBA1 003497, HEMBA1 004785, HEMBA1 005230, HEMBA1 005246, 

HEMBA1 006572, HEMBA1 007226, HEMBB1 0001 06, HEMBB1 000905, HEMBB1 001 959, 

HEMBB1 002661, MAMMA1001094, MAMMA1001532, MAMMA1001615, NT2RM1 000789, 

NT2RM2000773, NT2RM4000326, NT2RP1 000271 , NT2RP1 000468, NT2RP2000092, 

NT2RP2001538, 



NT2RP200071 2, NT2RP2000739, 

NT2RP2002564, NT2RP2002974, NT2RP20031 38, 

NT2RP2004847, NT2RP2005247, NT2RP2005391 , 

NT2RP2006092, NT2RP3000148, 



NT2RP2001662, 
NT2RP2003302, 



NT2RP2001817, 
NT2RP2003940, 



NT2RP2005535, NT2RP2005774, 
NT2RP3000378, NT2RP3000652, 



NT2RP3000232 

NT2RP3004090, NT2RP30041 1 9, NT2RP3004294, OVARC1 001 049, OVARC1 001 086, 
OVARC1001807, PLACE1 000258, PLACE1 000442, PLACE1 000907, PLACE1 003529 



HEMBA1 005929, 

MAMMA1002142, 
NT2RP2001769, 
NT2RP3000172, 
NT2RP3004067, 

OVARC1001338, 
PLACE1 006208, 

THYRO1001457, 
Y79AA1 002376, 



HEMBA1 002048, 
HEMBA1 006276, 
HEMBB1002051, 
NT2RM2000632, 
NT2RP2000610, 
NT2RP2001948, 
NT2RP2004108, 
NT2RP2005941, 
NT2RP3001976, 
OVARC1001132, 
PLACE1004166, 
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PLACE1004168, PLACE1 004887, PLACE1 005250, PLACE1 005682, PLACE1 006079, 
PLACE1011407, PLACE1011978, THYR01 000580, Y79AA1 000030, Y79AA1001090, 
Y79AA1 002334, Y79AA1 002378, 

[0254] The following 85 clones were categorized into disease-associated proteins 



PLACE1 008549, 
Y79AA1 001523, 



w 



15 



20 



25 



30 



35 



40 



45 



50 



55 



BNGH41 000020, HEMBA1 000349, HEMBA1 000590, HEMBA1 000671 , 
HEMBA1 001 228, HEMBA1 001 886, HEMBA1 0031 20, HEMBA1 004250, 
HEMBA1 006707, HEMBA1 006749, HEMBA1 006902, HEMBA1 00691 6, 
MAMMA1 000204, MAM MA 1002080, NT2RM2000632, NT2RM20011 26, 
NT2RP1 000465, NT2RP1 000579, NT2RP2000447, NT2RP200051 4, 
NT2RP2001 529, NT2RP2001 562, NT2RP2002674, NT2RP2003369, 
NT2RP2005535, NT2RP2005941 , NT2RP2006004, NT2RP3000059, 
NT2RP3000232, NT2RP300061 6, NT2RP3000677, NT2RP3000838, 
NT2RP3002286, NT2RP3002721 , NT2RP3002737, NT2RP3002738, 
OVARC1 000275, OVARC1 000331 , OVARC1 000410, OVARC1001086, 
OVARC1001725, OVARC1001952, PLACE1 000258, PLACE1 000442, 
PLACE1001500, PLACE1 002905, PLACE1 002967, PLACE1 003407, 
PLACE1 005239, PLACE1 00581 5, PLACE 1 007028, PLACE1 00871 6, 
PLACE20001 1 8, THYR01 000580, THYR01 000866, THYR01 001 071 , 
Y79AA1001530, 

[0255] Out of them, the following 67 clones are those which matched the data of Swiss-Prot database and GenBank 
or UniGene database, genes or proteins which had been deposited in the database of Online Mendelian Inheritance 
in Man (OMIM) (http://www.ncbi.nim.nih.gov/Omim/), which is a database of human genes and diseases. (The corre- 
sponding OMIM numbers are indicated after the clone names.) 

HEMBA1 000349(*600046), HEMBA1 000590(*603897), HEMBA1 000671 (*602277), 



HEMBA1 000835, 
HEMBA1 005246, 
HEMBA1007013, 
NT2RM2001558, 
NT2RP2000739, 
NT2RP2004108, 
NT2RP3000125, 
NT2RP3000921, 
NT2RP3004481, 
OVARC1001132, 
PLACE 1000907, 
PLACE1 003428, 
PLACE1011407, 
THYRO1001478, 



HEMBA1001184, 
HEMBA1 005267, 
HEMBB1002120, 
NT2RP1 000271, 
NT2RP2001223, 
NT2RP2004205, 
NT2RP3000201, 
NT2RP3001542, 
OVARC1 000208, 
OVARC1001607, 
PLACE1001100, 
PLACE1 005005, 
PLACE1011978, 
Y79AA1001062, 



HEMBA1003120(*602277), 
HEMBA1 006707(*603897), 
HEMBA1 00701 3(*603730), 
NT2RM2001 558(*604689), 
NT2RP2000514(*602431), 
NT2RP2001 562(*603371 ), 
NT2RP2004205(*601610), 
NT2RP3000125(*1 80202), 
NT2RP3000677(*1 42765), 
NT2RP3002721(*1 18950), 
OVARC1 000275(*1 25647), 
OVARC 1 00 1 72 5(*603046) , 
PLACE1000907(*1 94558), 
PLACE1005005(*603124), 
PLACE1011978(*603971), 



HEMBA1004250(* 600976), 
HEMBA1006749(*603897), 
HEMBB1002120(*603367), 
NT2RP1 000271 (*603899), 
NT2RP2000739(*1 94558), 
NT2RP2002674(*1 32811), 
NT2RP2005535(*603899), 
NT2RP3000201 (*604666), 
NT2RP3000838(*1 90370), 
NT2RP3002738(*602265), 
OVARC1 000331 (*1 39265) , 
OVARC 1 001 952 (* 190370), 
PLACE1001500(*603781), 
PLACE 1 005239(*603365), 
PLACE2000118(*301000), 



HEMBA1 005246(*602291 ), 
HEMBA1006902(*603897), 
MAMMA1 002080(*602672), 
NT2RP1000465(*602231), 

. NT2RP2001223(*603558), 
NT2RP2003369(*1 79555), 
NT2RP2006004(*600245), 
NT2RP3000232(*602277), 
NT2RP3001542(*191161), 
NT2RP3004481(*601610), 
OVARC1 00041 0(*603874), 
PLACE1000258(*603971), 
PLACE1002905(*125950), 
PLAC E 1 007028 (*602 131), 

THYR01 000580{*602277), 



HEMBA1 001 886(*603899), 
HEMBA1005267(*106410), 
HEMBA1006916(*601524), 
■ NT2RM2001 1 26(*603785), 
NT2RP2000447(*602580), 
NT2RP2001 529(*603289), 
NT2RP2004108(*601260), 
NT2RP3000059(*1 06410), 
NT2RP300061 6(*600245), 
: NT2RP3002286(*604331), 
OVARC 1 000208(*603603), 
OVARC 1 001 086(*603862), 
PLACE1 000442(*601 260), 
PLACE 1 003428(*603570), 
PLACE1 01 1 407(*602277), 
THYR01 000866(*604045), 



THYR01 001 071 (*603533), Y79AA1 001 062(*1 91161), Y79AA1 001 530(*602662), 

[0256] Out of 21 5 clones excluding the above-mentioned clones, HEMBB1 000668 and NT2RM4001 377 presumably 
belong to a group of signal transduction-associated proteins, based on the results of domain search by Pfam. 
[0257] HEMBB1 002302 presumably belong to a group of transcription-associated proteins, based on the results of 
domain search by Pfam. 

[0258] In the 437 clones categorized into secretory and/or transmembrane proteins on the basis of their full-length 
sequences, 41 0 clones were also predicted to encode proteins having functions of secretory and/or membrane proteins 
on the basis of their partial nucleotide sequences (5' sequences). In the 146 clones categorized into glycoprotein- 
associated proteins on the basis of their full-length sequences, 124 clones were also predicted to encode proteins 
having functions of glycoprotein-associated proteins on the basis of their partial nucleotide sequences. In the 57 clones 
categorized into signal transduction-associated proteins on the basis of their full-length sequences, 46 clones were 
also predicted to encode proteins having functions of signal transduction-associated proteins on the basis of their 
partial nucleotide sequences. In the 81 clones categorized into transcription-associated proteins on the basis of their 
full-length sequences, 57 clones were also predicted to encode proteins having functions of transcription-associated 
proteins on the basis of their partial nucleotide sequences. In the 85 clones categorized into disease-associated proteins 
on the basis of their full-length sequences, 6 clones were also predicted to encode proteins having functions of disease- 
associated proteins on the basis of their partial nucleotide sequences. The number of clones which were predicted to 
encode disease-associated proteins based on the full-length nucleotide sequences is much greater than that predicted 
based on the partial sequences. The reason is that the full-length sequences were categorized by using the data found 
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in the OMIM database into the category of disease-associated proteins. 

[0259] When the predicted functions based on the partial sequences were different from those based on the full- 
length sequences, several reasons were presumed; the ORF is too short in the partial sequence as compared with 
that of the full-length sequence; alternatively, P value for the partial sequence was greater than that for the full-length, 
5 that is, as compared with the probability of occurrence of the predicted function found in the full-length sequence, the 
probability was lower in the partial sequence. A protein does not always belong solely to a single category of the above- 
described functional categories, and therefore, additional functions can be found for the cDNA clones by further anal- 
yses. 

[0260] It is unclear, by the analyses for the full-length sequences so far, whether or not the remaining 212 clones 
w encode proteins belonging to any of the categories of secretory and/or membrane proteins, glycoprotein-associated 
proteins, signal transduction-associated proteins, transcription-associated proteins or disease-associated proteins. 
Nonetheless, the functions which were predicted based on the partial sequences can be verified by further analyses. 
[0261] Among the 212 clones, there are 38 clones that presumably belong to the category of enzymes and/or me- 
tabolism-associated proteins, cell division- and/or cell proliferation-associated proteins, cytoskeleton-associated pro- 
's teins, nuclear proteins, DNA-and/or RNA-binding proteins, ASP- and/or GTP-binding proteins, protein synthesis- and/ 
or protein transport-associated proteins, or cellular defense-associated proteins. The clones containing results of ho- 
mology search of Swiss-Prot database were categorized by considering the keywords and mentioned items in the 
matching data. The clones containing results of homology search of GenBank or UniGene database were categorized 
by considering the definitions and mentioned items in the matching data. 
20 When the matching data contained keywords such as "metabolism", "oxidoreductase" and "E.C. No. (Enzyme com- 
mission number)*, the clones were herein defined as clones presumably belonging to the category of enzymes and/ 
or metabolism-associated proteins. When the matching data contained keywords such as "cell division", "cell cycle", 
"mitosis", "chromosomal protein", "cell growth" and "apoptosis", the clones were herein defined as clones presumably 
belonging to the category of cell division- or cell proliferation-associated proteins. When the matching data contained 
25 keywords such as "structural protein", "cytoskeleton", "actin-binding" and "microtubules", the clones were herein de- 
fined as clones presumably belonging to the category of cytoskeleton-associated proteins. When the matching data 
contained keywords such as "nuclear protein", the clones were herein defined as clones presumably belonging to the 
category of nuclear proteins. When the matching data contained keywords such as "DNA-binding" and "RNA-binding", 
the clones were herein defined as clones presumably belonging to the category of DNA- or RNA-binding proteins. 
30 When the matching data contained keywords such as "ATP-binding" and "GTP-binding\ the clones were herein defined 
as clones presumably belonging to the category of ATP- and/or GTP-binding proteins. When the matching data con- 
tained keywords such as "translation regulation", "protein biosynthesis", "amino-acid biosynthesis", "ribosomal protein", 
"protein transport" and "signal recognition particle", the clones were herein defined as clones presumably belonging 
to the category of protein synthesis- and/or protein transport-associated proteins. When the matching data contained 
35 keywords such as "heat shock", "DNA repair" and "DNA damage", the clones were herein defined as clones presumably 
belonging to the category of cellular defense-associated proteins. 

[0262] The following 1 0 clones presumably belong to enzymes and/or metabolism-associated proteins. 
HEMBA1003315, HEMBB 1002465, MAMMA1000614, NT2RP2000178, NT2RP2001388, NT2RP2001 903, 
NT2RP2002304, NT2RP2005878, NT2RP3001685, PLACE1006219 
40 [0263] The following 4 clones presumably belong to cell division-associated and/or cell proliferation-associated pro- 
teins. 

MAMMA1 000403, NT2RM2000497, NT2RP2000394, Y79AA1002121 
[0264] The following 6 clones presumably belong to cytoskeleton-associated proteins. 
MAMMA1001609, NT2RM2000589, NT2RP3000063, PLACE1 004078, PLACE 1004492, PLACE 1008657 
45 [0265] The following 7 clones presumably belong to nuclear proteins. 

HEMBA1001878, HEMBA1 002992, MAMMA1000614, NT2RM4000965, NT2RM2001738, NT2RP2001388, 
Y79AA1002121 

[0266] The following 5 clones presumably belong to DNA- and/or RNA-binding proteins. 
HEMBA1 003072, HEMBA1 006770, HEMBA1 007332, NT2RM2000497, Y79AA1002121 
so [0267] The following 7 clones presumably belong to ATP- and/or GTP-binding proteins. 

HEMBA1002316, MAMMA1001609, NT2RM2000306, NT2RM2000497, NT2RP2000178, NT2RP3003729, 
PLACE1 004305 

[0268] The following 7 clones presumably belong to protein synthesis- and/or protein transport-associated proteins. 
NT2RM4000965, NT2RP2005069, NT2RP3000481 , NT2RP3000789, NT2RP4001877, OVARC1001833, 
55 OVARC1 002058, 

[0269] The following clone presumably belongs to cellular defense-associated proteins. 
PLACE 1005539 

[0270] Although it is unclear whether or not 26 out of 1 74 clones other than the above-mentioned clones belong to 
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any of the above-described categories, these clones are predicted to have some functions, based on the homology 
search using their full-length sequences. The clone names and the gene definitions found in the result of homology 
search are shown below, separated by a double-slash,// 

HEMBA1000634//Homo sapiens T-cell activation protein (PGR1) gene, complete cds. 
HEMBA1002524//Human MHC Class I region proline rich protein mRNA, complete cds. 
HEMBA1003399//MVP1 PROTEIN, 

HEMBA1005489//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, com- 
plete cds. 

HEMBB1000542//Mus musculus bromodamain-containing protein BP75 mRNA, complete cds. 
MAMMA1000788//Bos taurus P14 (p14) mRNA, complete cds. 
MAMMA1002128//ABC1 PROTEIN HOMOLOG PRECURSOR. 
NT2RM2000514//Homo sapiens F-box protein Fbx21 (FBX21 ) mRNA, complete cds. 
NT2RM2000622//Mus musculus F-box protein FBL10 mRNA, partial cds. 
NT2RM4000100//Homo Sapiens Leman coiled-coil protein (LCCP) mRNA, complete cds. 
NT2RP2005425//Homo sapiens mRNA for AKAP450 protein. 

NT2RP3001170//Mus musculus activity-dependent neuroprotective protein (Adnp) mRNA, complete cds. 

NT2RP3002571//Bos taurus mRNA for lyncein. 

NT2RP3004557//Human Ki nuclear autoantigen mRNA, complete cds. 

OVARC1001596//Homo sapiens Arf-like 2 binding protein BART1 mRNA, complete cds. 

PLACE1 0021 53//Homo sapiens TACC2 protein (TACC2) mRNA, partial cds. 

PLACE1003163//Homo sapiens DBI-related protein mRNA, complete cds. 

PLACE1 005736//Human mRNA for BAS-GRIP protein. 

PLACE1007702//Mus musculus TRA1 mRNA, complete cds.. 

PLACE1011045//Homo sapiens E1 -like protein mRNA, complete cds. 

THYRO1000061//Mus musculus mRNA for UBE-1c1, UBE-1c2, UBE-1c3, complete cds. 

THYR01000964//Drosophila melanogaster Felle associated protein Pellino (Pli) mRNA, complete cds. 

Y79AA1000776//Mus musculus mRNA for GSG1, complete cds. 

Y79AA1001056//Homo sapiens MAID protein mRNA, complete cds. 

Y79AA1001272//Homo sapiens retinoic acid repressible protein (RARG-1) mRNA, complete cds. 
Y79AA1001793//Mus musculus mRNA for GSG1, complete cds. 



[0271] So far, useful information for presuming the functions are unavailable for the remaining 148 clones, of which 
names are listed below. 

HEMBA1 000275, HEMBA1 000300, HEMBA1 000443, HEMBA1 000875, 



HEMBA1 001 296, HEMBA1 001 563, HEMBA1 0021 64, HEMEA1 002239, 

HEMBA1 003487, HEMBA1 004007, HEMBA1 004067, HEMBA1 004085, 

HEMBA1005145, HEMBA1005430, HEMBA1005913, HEMBA1006016, 

HEMBA1006912, HEMBA1 007057, H EM BA1 007063, HEMBA1 007291 , 

HEMBB1 000642, HEMBB1 001 200, HEMBB1 001 547, HEMBB1 002039, 

MAMMA1 000046, MAMMA1 000449, MAMMA1 000528, MAMMA1 000652, 

MAMMA1001066, MAMMA1001284, MAMMA1001623, MAMMA1001634, 

MAMMA1 002205, MAMMA1 002224, NT2RM2000582, NT2RM2001 643, 



NT2RM4001321 , NT2RP1 000002, NT2RP1 000239, 
NT2RP2000240, NT2RP2001 878, NT2RP2001 921 , 
NT2RP2003599, NT2RP2003931 , NT2RP2004069, 



NT2RP2005514, 
NT2RP3000444, 
NT2RP3002281, 
NT2RP3002887, 



NT2RP2005632, NT2RP2005887, 
NT2RP3000645, NT2RP3000871 , 



NT2RP3002324, 
NT2RP3002983, 



NT2RP3002353, 
NT2RP3003448, 



NT2RP1 000679, 
NT2RP2002015, 
NT2RP2004141, 
NT2RP2006099, 
NT2RP3001044, 
NT2RP3002409, 
NT2RP3003469, 



NT2RP3003963, NT2RP3004000, NT2RP3004202, NT2RP3004321, 
NT2RP3004355, NT2RP3004374, NT2RP400271 5, OVARC1 000090, 
OVARC1 000775, OVARC1 000853, OVARC1 000995, OVARC1 001 222, 

PLACE1 001 1 1 4, PLACE1 001 229, 
PLACE1 004028, PLACE1 0041 99, 



OVARC1002178, PLACE1 000986, 
PLACE1 003460, PLACE1 003644, 



PLACE1 005669, PLACE1 005768, PLACE1006515, PLACE1 006786, 
PLACE1 007591 , PLACE1 007971 , PLACE1 008984, PLACE1 009735, 
THYR01 000846, THYR01 000999, THYR01 001 063, TH YR01 001 1 28, 



HEMBA1000907, 
HEMBA1 002985, 
HEMBA1 004952, 
HEMBA1006517, 
HEMBB1 000276, 
HEMBB1 002228, 
MAMMA1 000706, 
MAMMA1001901, 
NT2RM4000115, 
NT2RP1 000740, 
NT2RP2002409, 
NT2RP2004447, 
NT2RP2006134, 
NT2RP3001061, 
NT2RP3002448, 
NT2RP3003473, 

OVARC1000137, 
OVARC1 001260, 
PLACE1001788, 
PLACE1004519, 
PLACE1 007040, 
PLACE2000219, 
THYRO1001471, 



HEMBA1001272, 
HEMBA1 003294, 
HEMBA1 004971, 
HEMBA1006544, 
HEMBB1 000309, 
HEMBB1 002663, 
MAMMA1000814, 
MAMMA1 002087, 
NT2RM4000295, 
NT2RP1 000903, 
NT2RP2002510, 
NT2RP2004837, 
NT2RP3000427, 
NT2RP3001754, 
NT2RP3002664, 
NT2RP3003559, 

OVARC 1000467, 
OVARC1001727, 
PLACE1 003438, 
PLACE1 005601, 
PLACE1 007077, 
PLACE4000455, 
THYRO1001495, 
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THYRO1001608, THYRO1001803, Y79AA1000127, Y79AA1 000750, Y79AA1001592, Y79AA1001863, 

EXAMPLE 18 

5 Expression frequency analysis using PCR 

[0272] Many genes acting at the downstream of TNF-. and IL-1 . among inflammation-associated cytokines have 
been previously identified. The respective stimulations are transduced through independent pathways of signaling 
cascade. There exists another signaling cascade for both stimulations, wherein NF-.B is a common transducing mol- 

10 ecule shared by the two stimulations (J. Leukoc. Biol., 1994, 56(5): 542-547). It has also been revealed that many 
inflammation-associated genes, including IL-2, IL-6 and G-CSF, are varied in their expression levels in response to 
the signal through the common pathway (Trends Genet. 1999, 15(6): 229-235). A survey was performed by using 
ATAC-PCR method (adaptor-competitive PCR method: Nucleic Acids Res. 1997, Nov 15; 25(22): 4694-6) for genes 
of which expression levels were varied depending on stimulation of inflammatory cytokines, TNF-. and IL-1 .. It is pos- 

15 sible that genes of which expression is varied in response to this stimulation also participate in inflammation. 

[0273] Jurkat cells (Dainippon Pharmaceutical Co., Ltd.: catalog No. 06-152) were cultured in a PRMI1640 medium 
(Nikken Biological and Medical Institute: catalog No. 1 4-501 F) containing 10% fetal calf serum until the cell count 
reached 10 7 cells. The cells were transferred into a fresh medium containing 10 ng/m! TNF-. (recombinant Tumor 
Necrosis Factor; Wako pure chemical Industries Inc.: catalog No. 201-13461) or IL-1. (recombinant lnterleukin-1.; 

20 PeprotechEC: catalog No. 200-01 B) and, further, cultured at 37. under an atmosphere of 5% C0 2 . The cells cultured 
in the presence of TNF-. were harvested 1, 3 and 7 hours after addition of TNF-.. The cells cultured in the presence 
of IL-1 . were harvested 1 and 7 hours after addition of IL-1 Total RNA was extracted from each of the celts by AG PC 
method (Acid-Guanidinium-Phenol-Chloroform method: Ana Biochem. 1987, Apr; 1 62(1 ):1 56-9). Total RNA was also 
extracted form the cells in the absence of any stimulation of TNF-. and IL-1 

25 [0274] ATAC-PCR analysis is performed basically according to the same procedure as described in "DNA Microarray 
and Advanced PCR Methods" (Cell Engineering, p. 104-112, (additional volume, Genome Science Series 1), Muram- 
atsu & Naba (eds.), Shujunnsya). Adaptor ligation reaction was performed for an internal standard sample (which was 
used for preparing a calibration curve for the assessment of the test samples) and test samples in the following inde- 
pendent two reaction systems. Combinations of each type of the 6 adaptors (AD-1 , AD-2, AD-3, AD-4, AD-5, and AD- 

30 6: see the sequences shown below) with each sample are as follows: 

Reaction system A 

AD1: internal standard sample (x10 concentration) 
35 AD2: sample before stimulation 

AD3: internal standard sample (x3 concentration) 
AD4: sample with IL-1 stimulation for 1 hour 
AD5: sample with IL-1 stimulation for 7 hours 
AD6: internal standard sample (x1 concentration) 

40 

Reaction system B 

AD1: internal standard sample (x1 concentration) 
AD2: sample with TNF stimulation for 1 hour 
45 AD3: sample with TNF stimulation for 3 hours 

AD4: internal standard sample (x3 concentration) 
AD5: sample with TNF stimulation for 7 hours 
AD6: internal standard sample (x10 concentration) 

50 

Adaptor sequence 
AD1; 

SEO ID NO:4180//5'-GTACATATTGTCGTTAGAACGCG-3' 
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SEQ ID N0:418l//3 , -CATGTATAACAGCAATCTT(;CGCCTAG-5 , 
A02; 

SEQ ID N0:4182//5'-GTACATATTGTCGTTAGAACGCGACT-y 
SEQ ID N0:4 1 83/ / 3' -CATGTATMCAGCAATCTTGCGCTGACTAG - 5' 
AD3; 

SEQ ID N0:4 1 84//5'-GTACATATTGTCGTTAGAACGCGCATACT-3* 
SEQ ID N0:4 1 85//3 1 -CATGTATAACAGCAATCTTGCGCGTATCACTAG-5' 
AD4; 

SEQ ID N0:41 86/ /5* -GTACATATTGTCGTTAGAACGCGATCCATACT— 3* 
SEQ ID N0;4 1 87 // 3' -CATGTATAACAGCAATCTTGCGCTAGGTATGACTAG-5' 
ADS; 

SEQ ID N0:4188//5 , -GTACATATTGTCGTTAGAACGCGTCAATCCATACT-3 > 
SEQ ID N0:4189//3'-CATGTATAACAGCAATCTTGCGCAGTTAGGTATGACTAG-5' 
AD5; 

SEQ ID N0:4 1 90//5'-GTACATATTGTCGTTAGAACGCGTACTCAATCCATACT-3' 
SEQ ID NQ:4 1 9 1 //3 ' - C ATGTATAAC AGC AATCTTG C GC ATGAGTTAG GTATG ACTAG - 5 ' 



[0275] In this assay, the internal standard samples used were total RNA from cultured cells or human tissues from 
which the cDNA libraries originated. The cultured cells and the total RNAs from tissues are indicated below. Culture 
of the cells was performed according to the method as described in the supplier's instruction manual. RNA preparation 
was carried out by standard methods. 

Human teratocarcinoma cell NT-2 (Stratagene, catalog No. 204101) 

Human neuroblastoma cell SK-N-MC (Dainippon Pharmaceutical Co., Ltd., catalog No. 04-010) 
Human neuroblastoma cell Y79 (Dainippon Pharmaceutical Co., Ltd., catalog No. 04-018) 
Human placenta tissues total RNA (BioChin, catalog No. 064008) 
Human breast tissue total RNA (Clontech, catalog No. 64037-1) 



[0276] PGR primers used for amplification of specific genes, and names of the corresponding cDNA clones are 
shown below. The assay was not carried out for clones of which corresponding internal standard sample could not be 
prepared for the assay. The gene-specific primers were designed so that the PCR products derived from the cDNAs 
with adaptor were 70-200 bp in size. Sequence of the adaptor-specific primer (labeled with fluorescent dye (FAM)) 
used for the competitive PCR was GTACATATTGTCGTTAGAACGC (22 nucleotides, SEQ ID NO: 4192). PCR was 
performed basically at 94. for 5 minutes; and at 94. for 30 seconds, at 50. for 60 seconds, and at 72. for 60 seconds 
for 30 cycles. The annealing temperature was, however, changed in some PCR experiments. 
[0277] Nucleotide sequence of clone-specific primer (all the primers consist of 20 nucleotides) used in this experiment 
[0278] Clone names, primer sequences, and SEQ ID NOs were shown in this order, separated with a double-slash 
mark, // 
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Y79AA1000967//GGCACAGACACCATCCTTGA//:;EQ 10 N0:4461 
Y79AA 1 00 1 056//ACAAATGAGCCTGAAAAGTC//S E0 ID N0:4462 
Y79AA 1001 062//TGGTCCTCACTGCCTTCAAA/ /SEO 10 N0:4463 
Y79AA1001090//AGTGCCCTCAAAGCTCCAGT//SEQ 10 N0:4464 
Y79AA 100121 2//ACGAAAGCACTCAAATGTCA//SE0 ID N0:4465 
Y79AA1001272//GAATGAAATGTGGTTGAGCA//SEQ 10 N0:4466 
Y79AA1 001 426//AATGATTCGGGGCAGCAGGA//5EQ ID NO:4467 
Y79AA 1 00 1 42 7// GAG AGAGACACACAC AGAAA/ /SEQ 10 N0:4468 
Y79AA1 00 1 523//AGTTTTATACCAGCATTGGC/ /SEQ ID NO:4469 
Y7 9AA 1 00 1 530/ / GGT GTAG AAGT AAAATGGGA/ /SEQ !D N0:4470 
Y79AA1001592//GATTGTGTTCTCTTACTCCT//SEQ 10 N0:4471 
Y79AA1001727//GCTCCACCTGACGTTCTTTA//SEQ ID NO:4472 
Y79MI001795//GTCTCCCATATCGCTGTCTT//SEQ 10 N0:4473 
Y79AA1 00 1803//CACTTTCTAATAACCCTTGG//SE0 ID N0:4474 
Y79AA1 00 1 863/ /TTGGGATTGGAAACCGCATT//SEQ ID N0:4475 
Y79AA1 00 1 874 //AGAAACCACTGAGGCCCAAG//SEQ ID N0:4476 
Y79AA 1 002 05 8 //CAGAAG CAGAAGCAGGAGCA/ /'S EQ ID N0:4477 
Y79AA100212l//AnTACTGCGATnCTCCTG//SEQ ID N0:4478 
Y79AA1 002 1 29//GAGTTTCTTTGCTAGTTCCA//SEQ ID N0:4479 
Y79AA1 002334/ /ATATTTGTGTTTGCCTTGGG/ /SEQ ID NO:4480 
Y79AA1002373//GGATGGCTGGGTCAAATGGT//SEQ ID N0:4481 
Y79AA1 002376//AATGATGGCTAGGGTGACTT/ /SEO ID NO:4482 
Y79AA1002378/ /TCTTCCCACATTCGTTACAC//SEQ ID NO:4483 
Y79 AA1 00 238 1 //AGGGAGTAG ATGTTGGTAAA/ /SEO ID N0:4484 



[0279] The result of expression frequency analysis is shown in Table 367. Only clones with correlation coefficient of 
0.9 or higher are indicated in this Table. Clones that are not presented in the Table include clones for which the assay 
could not performed because of low expression levels thereof in internal standard samples or because of unexpectedly 
smaller or larger sizes of the PCR products. 

[0280] Among the clones that could be analyzed, clones of which expression levels increased by two fold in response 
to the IL-1 . stimulation 1 or 7 hours after the stimulation are: NT2RM200051 4, NT2RP3001 1 59, MAMMA1 001 237 and 
MAMMA1000614. 

[0281] Clones of which expression levels increased by two fold in response to the TNF-stimulation 1 , 3 or 7 hours 
after the stimulation are: 

NT2RM2000582, NT2RM2002109, NT2RP1 000679, NT2RP2003664, NT2RP2005597, NT2RP2004108, 
NT2RP3001592, NT2RP3002738, NT2RP3004133, NT2RP3004321, NT2RP3004557, NT2RP3004294, 
MAMMA1001237, MAMMA1000141, MAMMA1000788, MAMMA1 002070, PLACE1002547, PLACE1003573, 
PLACE1004305, PLACE1008744, PLACE1011181, PLACE1010713, PLACE1O10011, Y79AA1000776, 
Y79AA1002129, 

[0282] Among the clones of which expression levels increased in response to IL-1. stimulation, MAMMA1001237 
was a clone of which expression level was varied in response to TNF-. stimulation. Among clones showing higher 
expression levels (with relative value of 5 or higher) prior to the stimulation, PLACE1 002080 is an example of clones 
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of which expression was suppressed by the stimulation. The expression of the clone decreased by three or more fold 
in response to the stimulation. These genes were found to be associated with inflammatory reaction induced by IL-1 . 
or TNF-.. 

[0283] In Example 1 5, the genes of which expression levels were varied by culturing in the presence of TNF-. were 
5 analyzed by hybridization with high-density DNA filter. As for 3 clones (NT2RP3004557, NT2RP3004294 and PLACE 
1 002547), the results obtained by ATAC-PCR method were similar to those obtained by hybridization method. However, 
the results obtained by ATAC-PCR method were not necessarily consistent with those obtained by the hybridization 
method. Possible reasons for the inconsistency are the difference in cells used between the two experiments, unavail- 
ability of some data in the ATAC-PCR experiment, and the difference in the method of data treatment. 

10 

Table 28 

Expression of each cDNA in human tissues (The Table also contains clones with no 
15 description in Examples) 
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Clone name 


Heart 


Lung 


P. gland 


Thyaus 


Brain 


Kidney 


Liver 


Spleen 


GAPDH(Crl) 


38.210 


32. 670 


23. 820 


13.580 


11.230 


21. 120 


24.910 


22. 440 


0actin(Cr2) 


279. 280 


368.870 


111. 100 


17. 500 


92.880 


14.650 


82. 990 


256. 790 


ADRGL 1000005 


53.882 


23.005 


32. 749 


22.858 


26.564 


24. 940 


22. 644 


27.001 


ADRGL 1000007 


94.778 


85.185 


160.457 


67. 191 


101.768 


62. 489 


67. 150 


73. 543 


ADRGL 1000009 


11. 141 


50. 520 


10.357 


7. 177 


6.013 


5.219 


14. 272 


21.225 


ADRGL 1000011 


71.656 


24. 579 


29. 358 


19. 473 


24. 898 


30. 747 


49. 220 


22.221 


ADRGL1000027 


36. 238 


25.252 


20. 855 


7.328 


11. 196 


14.298 


19.658 


11.288 


ADRGL 1000058 


66. 209 


129.497 


55.226 


49.241 


30.219 


55.872 


67.027 


243.436 


ADRGL1O00069 


38. 630 


23.459 


28.991 


12.540 


27.353 


33. 633 


28.774 


20.911 


ADRGL1000077 


97. 465 


63. 656 


448. 427 


83.412 


71. 108 


53.740 


67. 906 


89.439 


ADRGL1 000092 


89. 423 


45.692 


55.810 


26.033 


44.148 


73.339 


96. 037 


73.091 


ADRGL1000099 


73.675 


24.424 


36. 128 


17.024 


25. 964 


41.391 


42.837 


29.666 


ADRGL100Q136 


141.745 


63.974 


77.017 


24.777 


33.549 


58. 986 


295.009 


84. 985 


ADRGL 1000 147 


394. 563 


155.829 


271.210 


92.899 


165.627 


251. 266 


253. 420 


150.294 


ADRGL1000159 


50.073 


25.425 


39. 296 


15.194 


16. 125 


20. 040 


33.720 


23.278 


ADRGL1000160 


69. 386 


31.051 


59.416 


20.154 


39. 799 


27. 027 


47.169 


20.716 


ADRGL1000171 


57. 047 


23.011 


43.063 


23.860 


40.581 


59.814 


117.055 


32.630 


ADRGL1000181 


45.892 


18.666 


34, 476 


15.434 


34.225 


32.962 


39. 693 


16.334 


BGGI 1 1000015 


153.242 


42.337 


92.865 


41.003 


45. 168 


88. 524 


85. 990 


73.392 


BGGI 11000016 


177. 367 


94.731 


119.688 


34. 159 


30.249 


98. 806 


98. 783 


39. 204 


BGG M 1000017 


84.712 


32.614 


38. 131 


20. 878 


18.769 


32.340 


39. 666 


20. 750 


BGGI 11000022 


52. 468 


20.452 


67. 167 


12. 167 


11.158 


18.241 


19. 197 


11.937 


BGGI 11000031 


30. 008 


17.072 


40. 883 


12.585 


13.313 


15.525 


16.757 


13.406 


BGG 111 000042 


49. 926 


36.336 


51. 176 


26.964 


43. 122 


43.770 


49. 107 


38.776 


BGGI 11000046 


31.618 


26.472 


34. 182 


31.854 


12.650 


25.784 


18.430 


25. 385 


8NGH410QQC20 


5031.103 


2993. 496 


1444.841 


537. 162 


£973. 542 


5029.124 


3350. 527. 


3649. 144 


BNGH410Q0025 


91.717 


35.026 


73. 901 


27.713 


30. 765 


36. 523 


37.596; 


47.074 


BNGH41000026 


176.757 


77.439 


98. 345 


35.807 


56. 991 


91.310 


75.797 


70.241 


BNGH41Q00027 


65.029 


56. 353 


25. 896 


22.494 


12.763 


23.748 


17.836 


23.859 


BNGH41000035 


148.779 


66.776 


119.727 


56.576 


60. 996 


96.959 


72.461 


64.458 


BNGH41000037 


79. 500 


29.611 


43. 438 


18.317 


20. 857 


36. 272 


27. 525 


24.771 


BNGH41000042 


224.484 


110. 084 


168.448 


104.351 


102.259 


125. 323 


86. 783 


122.959 


BNGH41 000048 


56. 144 


32.253 


54.063 


14.729 


27.312 


22.435 


29. 566 


28. 937 


BNGH41 000056 


67.258 


18.694 


30.075 


15.602 


10. 072 


20. 735 


16.100 


7. 642 


BNGH41Q00087 


98.262 


46. 173 


77. 657 


35.329 


40. 900 


50.029 


50. 841 


45.285 


BNGH41000091 


50.895 


16.985 


28. 392 


10. 147 


5.469 


22. 794 


10,7*5 


12.410 


BNGH41000157 


69.043 


34.730 


40. 597 


18.088 


27.072 


22.074 


25.410 


24.950 


BNGH41000169 


44.850 


21.770 


28.655 


11.403 


25. 991 


28.509 


25.634 


25.843 


BNGH41000181 


17. 163 


15.689 


13.948 


3.996 


9.287 


13. 139 


15.553 


16.575 


BNGH4 1000 198 


81.510 


36. 250 


60. 860 


20. 585 


26. 929 


35.751 


31.695 


28.325 


BNGH41000219 


30.302 


25. 156 


22. 187 


13.757 


11.208 


15.235 


27.285 


35. 709 


BNGH41000229 


252.790 


65.948 


93.499 


51.103 


92. 555 


101.245 


96.716 


78. 266 


BNGH41000237 


85.757 


46.997 


55. 170 


26.780 


33.764 


47.456 


37.007 


39.131 


BNGH4 1000238 


17.744 


36.938 


42. 360 


14. 922 


35. 749 


42.848 


39. 238 


13.241 


BNGH41 000243 


45.446 


23.667 


44. 798 


20.875 


10.516 


23.918 


22.443 


27.033 


BNGH41000270 


60.889 


18.651 


29.618 


10.724 


15. 979 


12.351 


19. 152 


22.314 


BRAVHI 000004 


43.673 


28.539 


7.640 


11.388 


19. 198 


14.903 


32.353 


23.777 


BRAVH1000018 


59.409 


17.941 


102. 270 


17. 107 


709.078 


25. 732 


24.214 


24.767 


BRAWH1000021 


104.772 


29. 951 


51.142 


21.042 


1169.154 


55.762 


66.754 


27. 969 


BRAWH1 000027 


152. 205 


47.310 


67.089 


32.199 


64.521 


70.731 


79.670 


40. 928 


BRAVH 1000029 


106. 376 


49.221 


55.840 


40. 856 


59. 552 


56.487 


64.886 


100.132 


BRAVHI 000040 


29.419 


16.761 


31.101 


16.622 


30. 633 


18. 200 


17. 998 


15.196 


BRAVH 1000050 


161.264 


71.786 


118.976 


51.863 


61.542 


97. 720 


81.271 


69.194 


BRAVHI 000051 


74. 067 


34. 341 


44. 047 


20. 726 


30.434 


42.055 


53.856 


24.624 


BRAVH1000060 


68. 789 


22. 598 


35.012 


16.493 


19.127 


38.662 


34.923 


28.094 


BRAVH1000075 


17.318 


16.898 


36.437 


8.901 


18.133 


17.219 


9.321 


11.200 


BRAVHI 000081 


43.025 


12.998 


28.267 


7.655 


123. 677 


17.673 


15.924 


9.844 


BRAVHI 000084 


174.384 


42. 178 


80. 534 


47.752 


152. 188 


77. Ill 


110.187 


102.296 


BRAVH1000095 


118.239 


59.676 


64. $28 


28.174 


116.975 


53.814 


746.700 


35.985 


BRAVHI 000096 


146. 112 


44. 967 


85.882 


27.491 


145.013 


52. 880 


52.427 


58. 678 


BRAVHI 000097 


95.841 


72. 506 


174.954 


65. 637 


64. 200 


73.707 


63.827 


63. 762 


BRAVHI 0001 00 


11.943 


19.037 


18.950 


13.536 


92.145 


16.582 


16.646 


10.218 


BRAVHI 0001 01 


134.838 


57. 232 


106.632 


40.741 


96. 396 


71.642 


88. 432 


57.336 
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Table 58 



CMDD 1 UUU 0*0 


55. 058 


47. 453 


95.751 


42. 262 


25. 684 


15. 056 


22. 121 


28. 829 


CJKDD IUUUO*t£ 


79. 188 


88. 317 


251.754 


80 865 


42. 468 


81. 296 


37. 696 


52. 009 


CHDD 1 UU UO ** J 


43 4] | 


25. 689 


1 13. 037 


8. 985 


11. 038 


14. 245 


8. 276 


18. 743 


jciianinnnR4Q 

UDD tUUUDfS 


77 H57 


CMC. 


137 371 


'Id 816 

0*t. Old 


24 49K 

£4. *tjQ 


9 Q.R7 


1 1. 881 


"22. 322 


CJiDOIUUUDOc 


84 Q47 

OS. 3HC 


R1 RRR 
D 1 . O30 


17R 167 


78. Ill 
1 0. 1 o 1 


47 040 


1R 343 


77 857 

CC. OJt 


31. 597 


ucudjii nnnccc 


jift ma 

4 1 O. 5UO 


1 J. ill 


30. 030 


t7 1£R 


37 733 


R7 A94 


1A AQ7 

gO. 03I 


44 477 




1C 9C1 
lO. £33 


1 J. 994 


tft 7CR 
IU. I DO 


9rt 111 7 
2U. 0 1 r 


C 7QC 
O. f HO 


11 1 1 n 

1 J. 1 1 U 


7 QR7 

1 . 901 


4 4Rft 

4. 130 




£0. 3qi 


1 J. 4 3 3 


ii enc 
1 4. DUO 


1 1 7QB 


OC HA A 
CO. 044 


1 c n>iQ 

1 3. U49 


IC. D49 


i 1 9fl7 
1 1 . £U£ 


EMBB 1000671 


oon non 
239. UcU 


04 AC) ' 


3D 1 . 221 


Q 070. 


OC OAA 

oo. £44 


f 3. Ubo 


CC ftl9 
OO. Of £ 


OO OC7 

00. £Or 


tMBBI O0U673 


11 coo 

1 1 . 633 


3. * r9 


1 4 COO 

14. oZ9 


14. 9U4 


C QIC 
O. 910 


A Oft 

4. oil 


9 1/11 
£. 141 


19 D 1 9 


HEMBB1 000679 


16. 899 


7. 357 


23. 438 


7. 697 


1. 049 


30. 246 


7. / f4 


7 nc 0 
1 . 0o3 


HEHBB1 000684 


88.240 


157. 754 


430. 254 


128. 150 


66. 41 1 


89. 722 


49. 173 


67. 832 


HEMBB 1 000692 


4.978 


9.265 | 


11. 569 


5. 085 


1. 1 58 


3. 240 


3. 421 


1. 785 


KEMBB1000693 


63. 119 


40.561 i 


59. 522 


22. 326 


25. 408 


13. 898 


3 1 . 488 


20. 706 


KEMBB 1000705 


15. 560 


31. 798 


122. 757 


36. 451 


19. 928 


11. 568 


2. 839 


10. 179 


HEMBB 1000706 


22. 553 


13. 626 


23. 777 


8.621 


1 1. 683 


41. 509 


10. 019 


7. 584 


HEMBB 1000709 


74.737 


77. 864 


245. 726 


50. 833 


51. 093 


50. 427 


37. 955 


51. 357 


HEMBB1000714 


23.728 


10. 733 


6.625 


12.298 


6. 349 


9.891 


2. 142 


14. 350 


HEMBB10Q0725 


24.239 


9. 575 


11.437 


13.761 


12. 596 


17. 372 


8. 105 


16. 144 


HEMBBIOQ0726 


86. 971 


84. 395 


208. 396 


65. 157 


43. 881 


37.441 


22.020 


39.067 


HE1BB1000729 


51.556 


25. 288 


140. 931 


23.005 


27.775 


18.629 


12.838 


14. 902 


HEM8B1000738 


39.002 


38. 955 


166.616 


42.588 


21. 380 


43.330 


7. 181 


21. 192 


HEMBB 1000749 


115.917 


94. 942 


454.741 


136.454 


54. 340 


39. 253 


32. 933 


49. 141 


HEMBB 100076 3 


47.835 


25. 201 


36.488 


16.952 


21.036 


31.919 


1 4. 990 


12. Ill 


HEMBB 1000770 


30. 598 


45.410 


167. 003 


32. 786 


26.482 


25.698 


18. 186 


24. 127 


HEMBB) 000774 


27. 168 


21.690 


33.470 


20.937 


12.916 


22.598 


8.092 


17.606 


HEM8B1000777 


246. 286 


57.131 


58.743 


31.851 


40. 345 


119.113 


81. 364 


53. 990 


HEMBB1000781 


41.945 


36. 620 


34.149 


24. 543 


23. 561 


16.383 


14. 371 


20.775 


HEMBB 1000788 


10.756 


10.608 


5.481 


6.429 


2,950 


5.995 


4.522 


4. 589 


HEM8B100Q789 


28. 490 


9.620 


26. 151 


16.088 


11.640 


16.477 


7. 916 


7.672 


000790 


74.318 


56. 925 


185.959 


63.749 


33.523 


24.232 


24.414 


28.423 


HEMBB1Q00794 


18.080 


17.254 


38. 876 


24. 305 


7.427 


10.338 


5. 445 


9.305 


HEMBB1000807 


50.070 


31.869 


22.751 


19. 865 


t 20.934 


27.002 


18.350 


27.280 


HE«BB 1000809 


334.541 


42. 976 


42. 300 


26.454 


9.545 


31.526 


31.677 


44. 152 


HEMBB 1000810 


189. 365 


50.676 


163. 325 


33.349 


38.994 


74.400 


45. 398 


19.262 


HEMBB 1000821 


40.710 


9. 304 


21.006 


6.841 


5.422 


15. 981 


10. 835 


5.685 


HEMBB1000822 


8.726 


3.570 


3.541 


1.411 


7.255 


5.519 


1.285 


1.525 


HEABB1 000826 


68.485 


40. 348 


201. 149 


68. 467 


43. 204 


31.769 


32.812 


55. 387 


HEMBB 1000827 


50.671 


34. 326 


108. 391 


32.945 


15.076 


25.813 


18.713 


25.457 


HEMB31000831 


38.060 


20.466 


29. 131 


12.368 


19.990 


20. 562 


25.373 


6.415 


HEMBB 10003 3 5 


59. 181 


56. 345 


127. 358 


58. 150 


44. 350 


35.831 


25. 687 


35. 108 


HEMBB 1000840 


117.639 


63. 375 


340. 802 


61. 186 


48. 924 


38.995 


20.712 


30. 526 


HEMBB 1000848 


98. 938 


53. 024 


210. 423 


42. 569 


28. 984 


47. 603 


29. 642 


29.431 


HEMBB 1000852 


1.827 


2. 160 


0. 621 


2. 559 


1. 621 


1. 272 


1. 364 


1. 086 


FtElffiB 1000857 


16.897 


16. 768 


19. 951 


14. 921 


12.912 


17. 270 


10. 179 


14. 915 


HEMBB 1000858 


25. 634 


16. 531 


O 1 CO 

0. 162 


0 ono 
8. 209 


14. 482 


12. 749 


92. 823 


10. 102 


HEMBB 1000867 


106. 946 


56. 331 


264. 748 


50. 278 


36. 949 


41. 202 


26. 795 


29. 760 


HEMBB 1000870 


68. bbO 


62. 423 


19Z. 351 


co Ann 
bl. 4Ub 


39. 3U3 


CC C A 1 

33. 641 


OO 71a 
Z3. (JO 


07 4 07 


HEkJB 1000876 


21.813 


12. 044 


14. 900 


It O 1 A 


7 coo 


1 n con 
1U. D3U 


11 1 A O 

1 1. (43 


OC OA\ 

CO. £41 


HEMBB 1000881 


30.089 


Id. 478 


O O O A C 

28. 345 


1 A OOC 

1 4. 92b 


10 a 1 a 
lo. 419 


1 7 7C O 

1 ( . 763 


1 0 on 1 
18. 901 


on AO A 

20. 494 


■ irunn t nAna a t 

HEMBB 1000883 


11. 669 


10. 263 


oc t oc 

CO. l03 


C 07C 

0. 9l 3 


£. f OU 


0 000 


0 one 

c. yub 


3. b4U 


ncKBoluOOoor 


A*i CIO 

41. b3o 


17 OTA 
it. CIH 


CC 7ftfl 
DO. / OU 


LL. ill j 


11 9 


aj aA? 

4£. 04£ 


9fl CO 9 

cU. Oct 


CC. 300 


LfCiiDo i Anno o o> 
HEMBB 1000B88 


on o i o 
ZO. 318 


ft 101 
O. 19J 


1 1 ahi 

1 1 . 40J 


C 17ft 


A ft71 


ft 7flfl 
0. fU» 


c nm 
0. OU I 


A 7 A 1 ) 
4. 34c 


ucuoa'i nnno on 

HEMBB 1000890 


jtn 7nr 
40. ;93 


AO 907 

4c. co/ 


117 f!7C 


9C ni 1 

CO. UJ 1 


It 171 
II. Ill 


01 11c 
£ J. 110 


1 C i*Ol 
1 3. 4S 1 


1 C AA7 
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HEMBB 1000905 
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43.501 


37.125 


41.367 


26.379 


29. 649 


38. 699 


31.891 


HEMB81000908 


42. 944 


54.6/4 


120. 821 


34. 982 


28.838 
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15.897 


26.230 


HEMBB 10009 10 


72.960 


51.795 


161.850 


41.050 


36.594 


37.378 


13.612 


23.263 


KEMBB1000913 


33.820 


35.219 


96. 448 


24.68* 


12.371 


26.067 


14.715 


19. 268 


HEMBB1000915 
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222.511 
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124. 825 
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1548.228 
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223. 176 
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34. 989 
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[0284] Expression of each cDNA in human tissues (The Table also contains clones without description in Examples) 
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Table 185 

Expression of each cDNA in human pulmonary arterial endothelial cells cultured in a 
medium containing bovine serum albumin, glycated bovine serum albumin or advanced 
glycation endproduct of bovine serum albumin (This table also contains clones without 
description in Examples). 

In the table, EC_G_B/EC_BSA and EC_A_B/ECJBSA represent the ratios 
EC_glycated_BSA/EC_BSA and EC_AGE_BSA/EC_BSA, respectively. 



EC_glycated_BSA EC_G_B EC_A_B 
Clone_name EC_BSA EC_AGE_BSA /EC_BSA /EC.BSA 



GAPDH(Crl) 100.81 134.21 115.16 1.33 1.14 
Pactin(Cr2) 1101.9 1092.57 997.36 0.99 0.91 



ADRGL1000005 26.88 38.27 36.13 1 1 
ADRGL1000007 117.89 127.25 133.21 1.08 1.13 

ADRGL1000009 29.18 25.65 26.05 1 1 

ADRGL1000011 88.9 117.33 142.9 1.32 1.61 

ADRGL1000027 33.24 40.53 43.02 1.01 1.08 

ADRGL1O0O058 153.41 208.84 180.05 1.36 1.17 

ADRGL1O00O69 16.8 21.77 29.81 1 1 

ADRGL1000077 25.74 24.72 32.86 1 1 

ADRGL1000092 84.52 84.15 121.76 1 1.44 

ADRGL1000099 76.19 91.53 106.01 1.2 1.39 

ADRGL1000136 52.34 44.76 63.06 0.86 1.2 

ADRGL1000147 46.08 45.18 52.15 0.98 1.13 

ADRGL1000159 31.52 40.24 42.72 1.01 1.07 
ADRGL1000160 52.34 60.37 62.29 1.15 1.19 . 
ADRGL1000171 21.46 16.78 25.59 1 1 
ADRGL1000181 37.44 45.71 43.65 1.14 1.09 
BGGI1 1000015 52.42 71 65.47 1.35 1.25 

BGGI1 1000016 127.44 . 122.93 147.57 0.96 1.16 

BGGI11000017 25.65 25.74 31.33 1 1 

BGGI1 1000022 32.82 35.19 25.56 1 1 

BGGI1 1000031 44.42 43.8 40.25 0.99 0.91 

BGGI1 1000042 120.38 146.44 165.42 1.22 1.37 

BGGI1 1000046 74.72 58.85 84.95 0.79 1.14 
BNGH41000020 4286.08 3584.67 4330.96 0.84 1.01 
BNGH41000O25 216.67 223.74 257.06 1.03 1.19 
BNGH41000026 25.76 28.16 35.52 1 1 
BNGH41000027 29.23 23.83 17.86 1 1 
BNGH41000035 280.32 238.34 305.66 0.85 1.09 
BNGH41000037 59.14 54.86 54.58 0.93 0.92 

BNGH41000042 356.1 324.08 411.07 0.91 1.15 
BNGH41000048 1201.37 869.03 739.91 0.72 0.62 



BNGH41000O56 


33.94 


31.4 


40.01 


1 


1 


BNGH41000087 


77.58 


81.76 


91.07 


1.05 


1.17 


BNGH41000091 


21.05 


21.23 


26.82 


1 


1 


BNGH41000157 


81.11 


57.28 


77.46 


0.71 


0.95 


BNGH41000169 


21.1 


17.59 


22.53 


1 


1 


BNGH4 1000181 


63.54 


56.92 


70.08 


0.9 


1.1 


BNGH4 1000198 


32.53 


26.38 


34.37 


1 


1 
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HEMBB 1000858 50 

HEMBB 1000867 68.5 

HEMBB 1000870 72.76 

HEMBB1000876 43.35 

HEMBB10O0881 39.02 

HEMBB 1000883 27.77 

HEMBB 1000887 266.89 

HEMBB 1000888 17.01 

HEMBB 1000890 137.74 

HEMBB 1000893 74.94 

HEMBB 1000900 20.11 

HEMBB 1000905 102.51 

HEMBB 1000908 78.02 

HEMBB 10009 10 49.48 

HEMBB1000913 46.72 

HEMBB1000915 1235.16 

HEMBB 10009 17 69.87 

HEMBB 1000927 15.28 

HEMBB 1000932 45.3 

HEMBB 1000933 136.08 

HEMBB1000936 64.3 

HEMBB 1000939 68.9 

HEMBB 1000941 33.76 

HEMBB 1000947 35.6 

HEMBB 1000954 28.25 

HEMBB 1000959 63.24 

HEMBB 1000973 25.02 

HEMBB 1000975 16.17 

HEMBB 1000981 20.67 

HEMBB 1000985 42.97 

HEMBB 1000991 20.6 

HEMBB 1000996 100.51 

HEMBB 1001000 33.13 

HEMBB 1001004 14.02 

HEMBB 1001 008 29.03 

HEMBB100101 1 28.26 

HEMBB 1001014 35.72 

HEMBB 1001020 53.84 

HEMBB1001024 103.64 

HEMBB 1001 026 119.18 

HEMBB 1001 037 57.7 

HEMBB1001042 14.13 

HEMBB 1001 046 14.85 

HEMBB 1001 047 39.94 

HEMBB 1001 048 65.35 

HEMBB1001051 28.42 

HEMBB 1001056 34.89 

HEMBB1001058 39.94 

HEMBB 100 1060 22.94 

HEMBB 1001 063 27.68 

HEMBB 1001 068 29.13 

HEMBB1001082 54.37 



36.71 33.69 0.8 0.8 
85.51 71.5 1.25 1.04 
81.83 67.8 1.12 0.93 
49.98 46.2 1.15 1.07 
48.73 55.93 1.22 1.4 
27.83 32.89 1 1 
206.91 273.83 0.78 1.03 
15.62 22.89 1 1 
161.89 115.78 1.18 0.84 
56.68 80.9 0.76 1.08 

30.62 30.2 1 1 
124.91 83.08 1.22 0.81 
96.72 70.96 1.24 0.91 
58.33 49.88 1.18 1.01 

50.63 52.5 1.08 1.12 
1265.8 1596.28 1.02 1.29 
84.21 77.72 1.21 1.11 
21.63 16.34 1 1 
54.47 57.04 1.2 1.26 



158.95 162.12 1.17 1.19 
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1.22 


27.91 


25.38 
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19.83 


18.15 


1 


1 


24.33 


27.87 


1 


1 


44 


49.41 


1.02 


1.15 


19.71 


27.25 


1 


1 


115.97 


153.69 


1.15 


1.53 


32.47 


31.91 


1 


1 


13.99 


23.74 


1 


1 


29.07 


27.37 


1 


1 


27.11 


28.64 


1 


1 


32.4 


51.5 


1 1.29 


59.19 


61.08 


1.1 


1.13 


109.34 


125.21 


1.05 


1.21 


76.31 


81.91 


0.64 


0.69 


74.41 


69.67 


1.29 


1.21 


15.47 


21.5 


1 


1 


32.01 


16.89 


1 


1 


47.76 


56.41 


1.19 


1.41 


74.67 


76.93 


1.14 


1.18 


37.58 


37.86 


1 


1 


38.36 


38.94 


1 


1 


48.58 


50.58 


1.21 


1.26 


36.58 


33.65 


1 


1 


28.29 


27.17 


1 


1 


24.56 


27.82 


1 


1 


76.72 


82.56 


1.41 
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Y79AA 1002373 43.96 

Y79AAI002376 3080.78 

Y79AA1002378 73.33 

Y79AAIO02381 248.36 

Y79AA1002388 118.82 

Y79AA1002399 36.12 

Y79AA1002407 57.84 

Y79AA1002413 78.77 

Y79AA1002416 34.3 

Y79AA1002429 67.91 

Y79AA1002431 24.66 

Y79AA1002433 27.12 

Y79AA1002445 78.66 

Y79AA1002461 29.04 

Y79AA1002466 882.69 

Y79AA1002471 53.74 

Y79AA1002472 121.95 

Y79AA1002474 53.33 

Y79AA1002482 103.36 

Y79AA1002487 30.92 

Y79AA1002490 101.4 

Y79AA1002493 107.88 

ZRV6C1006278 46.63 



55.06 28.34 1.25 0.91 
3824.05 4481.1 1.24 1.45 

93.61 68.22 1.28 0.93 
288.51 304.13 1.16 1.22 
135.82 129.37 1.14 1.09 

30.1 32.87 1 1 
42.82 52.54 0.74 0.91 
81.36 87.31 1.03 1.11 

30.2 51.99 1 1.3 
69.81 80.19 1.03 1.18 
21.16 23.98 1 1 
18.11 23.63 1 1 
54.58 73.75 0.69 0.94 

24.84 32 1 1 
904.65 782.53 1.02 0.89 
51.26 68.91 0.95 1.28 

127.4 127.11 1.04 1.04 

40.85 47.18 0.77 0.88 
111.11 116.07 1.07 1.12 
25.8 32.51 1 1 
90.92 90.54 0.9 0.89 
125.54 105.75 1.16 0.98 
30.08 32.23 0.86 0.86 



Table 186 

Expression of each cDNA in undifferentiated NT2 cells, in NT2 cells cultured in the presence 
of retinoic acid, or in NT2 cells that were cultured in the presence of retinoic acid and then 
further cultured in the presence of cell-division inhibitor added (This table also contains clones 
without description in Examples) 

In the table, NT2, NT2_RA, and NT2_RA_INfflB represent untreated NT2 cells, retinoic 
acid-treated NT2 cells, and retinoic acid/inhibitor-treated NT2 cells, respectively. The assay 
was performed in triplicate (n=3), and each result was shown in the column of exp.l, exp.2, or 
exp.3. In addition, "t-test N/R" and "t-test N/I" represent results of test for significance of 
difference between the untreated cells and the retinoic acid-treated cells, and between the 
untreated cells and the retinoic acicl/inhibitor-treated cells, respectively. The results of the test 
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are shown in the columns of *:p<0.05 and **:p<0.01. 
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4* 
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5.01 


4.83 
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5.49 






! 




ADRGL1000147 


23.09 


13.85 


11.7 


14.77 


14.96 


14,89 


17.7 


13.3 
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Table 221 
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Table 366 

Expression of each cDNA in synovial cells or in the synovial cells in the presence of TNF 
(This table also contains clones without description in Examples) 

In the table, Synoviocyte and Synoviocyte_TNF represent synovial cells and TNF-treated 
synovial cells, respectively. The assay was performed in triplicate (n=3), and each result is 
shown in the column of exp.l, exp.2, or exp.3. In addition, "t-test vs TNF* represents a result of 
test for significance of difference between the untreated synovial cells and the TNF-treated 
synovial cells. The increase and decrease in the expression level of a particular gene in response 
to TNF are represented by + and respectively. The results of test for significance of difference 
are shown in the columns of *:p<0.05 and **:p<0.01. 



Clone Synoviocyte Synoviocute_TNF t test Inc. 

vs and 

exp.l exp.2 exp.3 exp.l exp.2 exp.3 TNF Dec. 



GAPDH(Crl) 0.4 0.8 0.89 0.9 1 1.15 
Pactin(Cr2) 385.94 262.23 582.98 443.28 422.61 573.47 
ADRGL1000005 2.72 2.97 4.46 7.27 7.45 3.51 
ADRGL1000007 4,36 5.19 9.58 20.78 19.59 18.29 ** + 
ADRGL1000009 0.99 1.25 1.64 2.16 4.08 2.02 
ADRGL1000011 1.98 3.56 5.24 22.22 23.49 19.81 ** + 
ADRGL 1000027 0.79 1.22 1.66 2.82 4.99 1.9 
ADRGL1000058 4.12 7.08 26.9 62.55 67.32 49.15 ** + 
ADRGL1000069 1.91 1.68 2.47 14.19 14.54 13.74 ** + 
ADRGL1000077 1.98 2 2.54 5.5 2.9 4.16 
ADRGL1000092 2.99 4.79 12.53 21.46 22.09 26.19 ** + 
ADRGL1000099 2.77 4.79 12.85 23.61 24.02 25.56 ** + 
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HEMBB1000714 1.41 2.85 9.32 5.31 10.37 8.79 
HEMBB 1000725 1.61 2.22 4.35 3.04 6.22 4.72 
HEMBB 1000726 1.88 2.34 8.76 5.63 7.1 4.83 
HEMBB 1000729 1.82 3.28 4.3 3.3 5.21 2.79 
HEMBB 1000738 1.94 2.6 5.55 3.99 5.53 6.15 
HEMBB1000749 4.06 4.15 7.47 7.48 9.56 8.27 
HEMBB 1000763 5.81 5.56 6.21 6.65 9.9 6.61 
HEMBB 1000770 2.76 2.06 8.8 7.73 9.62 8.83 
HEMBB 1000774 1.62 2.75 3.7 3.07 4.39 2.34 
HEMBB 1000777 5.17 5.49 7.42 6.86 4.9 7.37 
HEMBB1000781 3.7 4.19 6.89 7.64 5.28 6.83 
HEMBB1000788 0.87 1.79 2.45 2.65 4.88 1.35 
HEMBB1000789 1.91 2.22 3.01 3.1 6.55 1.86 
HEMBB1000790 1.97 2.15 4.48 4.59 4.21 2.53 
HEMBB 1000794 1.46' 1.8 2.85 2.97 3.84 2.06 
HEMBB1000807 2.55 2.72 5.18 3.57 4 4.26 
HEMBB1000809 30.31 26.87 132.99 158.22 156.74 195.14 
HEMBB1000810 1.98 2.67 4.51 3.76 6.08 4.12 
HEMBB1000821 1.98 1.93 2.98 2.05 4.45 1.79 
HEMBB1000822 1.08 1.97 2.31 1.65 5.31 1.46 
HEMBB1000826 1.36 1.99 3.57 3.77 6.11 3.6 
HEMBB1000827 2.48 2.89 5.83 2.67 5.05 2.99 
HEMBB1000831 3.4 2.31 5.67 3.84 7.74 2.95 
HEMBB 1000835 1.76 1.94 6.2 7.59 7.62 7.47 * H 
HEMBB1000840 1.27 2.95 6.89 4.48 7.19 3.01 
HEMBB1000848 2.08 3.45 5.63 5.39 6.45 5.3 
HEMBB 1000852 1.26 2.16 2.8 1.07 4.51 1.55 
HEMBB1000857 7.65 6.49 8.13 7.01 10.69 11.53 
HEMBB1000858 3.7 3.13 7.3 7.07 9.38 7.31 
HEMBB 1000867 2.21 1.84 4.9 3.02 5.55 4.04 
HEMBB1000870 1.64 2.37 4.56 2.84 5.31 3.63 
HEMBB 1000876 1.48 2.86 3.91 4.54 3.22 3.93 
HEMBB1000881 3.35 5.56 10.5 6.12 5.88 3.85 
HEMBB1000883 1.02 2.68 2.2 3.03 3.32 2.58 
HEMBB 1000887 16.9 14.54 43.41 67.39 61.26 59.84 * 
HEMBB1000888 1.03 1.67 2.39 1.63 3.92 1.86 
HEMBB 1000890 2.93 3.36 10.85 6.01 8.62 7.68 
HEMBB 1000893 3.28 2.54 5.46 4.5 6.14 5.57 
HEMBB1000900 1.27 1.53 2.98 2.06 2.54 1.58 
HEMBB 1000905 5.09 3.75 6.6 10.05 9.45 8.77 ** 
HEMBB1000908 3.34 2.79 3.01 4.48 4.71 5.7 ** 
HEMBB1000910 1.74 2.91 2.55 2.09 3.56 2.24 
HEMBB1000913 1.41 1.51 2.22 2.8 3.41 1.91 
HEMBB1000915 32.08 25.6 50.05 48 58.92 51.07 
HEMBB1000917 2.1 2.78 5.72 2.99 4.52 3.44 
HEMBB1000927 1.45 1.24 1.82 1.49 3.25 1.88 
HEMBB10O0932 0.66 2.06 2.74 1.81 3.41 1.61 
HEMBB1000933 7.47 7.12 10.71 12.88 12.78 19.19 
HEMBB 1000936 1.44 1.96 2.87 3.75 6.44 3.55 
HEMBB1000939 7.86 7.14 9.02 15.98 15.3 18.25 ** 
HEMBB 1000941 1.53 1.86 3.17 3.99 4.46 3.52 * 
HEMBB 1000947 3.53 3.34 4.61 4.67 6.8 5.72 
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Table 367 

Difference in the expression level of each clone in response to TNF. stimulation or IL-1. 
stimulation 

Before stimulation, IL1 lh, and IL1 7h represent relative levels of expression in the absence 
of the stimulation, 1 hour after the IL-1. stimulation, and 7 hours after the stimulation, 
respectively. TNF lh, TNF 3h, and TNF 7h represent relative levels of expression 1 hour after 
the TNF. stimulation, 3 hours after the stimulation, and 7 hours after the stimulation, respectively. 
Correlation coefficients 1 and 2 indicate the correlation coefficients in the calibration curves 
prepared based on the data for the internal standard in reaction systems A and B, respectively. 



IL1 TNF Correlation 

Clone before coefficients 

stimulation lh 7h lh 3h 7h 12 



NT2RM1000858 5.6 7.6 3.8 4.7 2.1 1.7 0.98 0.94 

NT2RM1000462 0.9 0.9 0.5 0.7 0.1 0 1 1 
NT2RM1000855 1 1.3 1 1.1 0.4 0.4 1 1 
NT2RM1000789 1 0.9 0.4 1 0.4 0.6 0.96 0.98 

NT2RM2000306 0.7 1.1 0.3 1.1 0.3 0.1 1 0.98 

NT2RM2000514 0.2 0.2 0.6 0.2 0.1 0.2 0.98 0.96 

NT2RM2001126 0.5 0 0.4 0.3 0.3 1.2 0.99 0.99 

NT2RM2001902 1.3 1.6 0.6 1.3 0.8 0.8 1 1 

NT2RM2001738 1.6 1.8 1.5 1.7 0.8 0.9 0.98 1 

NT2RM2000582 0.2 0.1 0 0.7 0.1 0.1 0.99 0.99 

NT2RM2000773 LI 1.2 1.4 2 1 0.8 0.95 1 

NT2RM2001626 0.4 0.2 0.6 0.7 0.1 0.7 1 1 

NT2RM2001643 1.6 3.1 L2 2.4 0.7 0.8 1 1 

NT2RM2001792 0.2 0 0 0.3 0.1 0.1 0.98 0.97 

NT2RM2000589 0.2 0.1 0 0.1 0 0 1 0.99 

NT2RM2000588 0.6 0.7 0.1 0.8 0.2 0.2 1 1 
NT2RM2002109 0 0 0 0.2 0.1 0 0.99 0.99 

NT2RM4000284 6.5 9.1 4.8 10.1 3.4 3 1 1 

NT2RM4001735 3.8 4.6 2.1 5 L6 1.4 1 1 

NT2RM4000100 0.5 0.6 0.2 0.5 0.3 0.3 0.95 0.95 

NT2RM4000417 0.2 0 0 0.2 0.1 0 0.99 0.98 

NT2RM4000761 3.2 3.2 2.2 2.6 0.7 0.7 0.95 1 

NT2RM4001843 1.5 1.8 1.7 2.8 1.2 0.6 0.98 1 

NT2RP1000239 2.1 3.2 1.2 2.1 0.5 0.6 1 0.99 

NT2RP1000465 0.9 0.3 0.3 0.9 0.2 0.1 0.97 0.96 

NT2RP1000679 0.3 0.3 0.4 0.9 0.2 0.3 0.97 1 

NT2RP1001031 1.4 1.4 0.4 1.2 0.1 0.3 1 0.98 

NT2RP2001200 2 1.5 0.8 2.2 0.7 0.7 0.99 1 

NT2RP2001562 2.7 2.4 0.7 3.6 0.4 1.1 1 0.94 

NT2RP2001948 1.1 1.5 0.7 1.3 0.6 0.7 0.97 0.99 

NT2RP2002015 1.3 1.7 0.7 1.8 0.6 0.5 0.99 1 

NT2RP2003390 2 1.7 1.3 2.3 0.6 0.5 0.99 0.99 

NT2RP2003664 0.4 0.1 0.1 0.8 0.1 0 0.99 0.99 

NT2RP2005597 1.2 1.4 0.5 2.7 2.2 2.2 0.96 0.99 

NT2RP2001469 1.7 1.4 1.2 2 0.6 0.6 1 1 

NT2RP2000240 0.9 0.9 0.3 1.4 0.7 0.3 1 1 

NT2RP2000610 2.4 2.2 2.1 2.7 1.5 1.6 0.93 0.96 
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NT2RP3000232 0.6 0.8 0.4 0.3 0 0 I 0.99 
NT2RP3001650 2.3 1.5 1.6 1.7 1 1.3 1 1 
NT2RP3002411 0.5 0.4 0.1 0.5 0.2 0.1 1 1 
NT2RP4001001 0.8 1.3 0.7 1 0.7 0.4 0.97 0.97 
NT2RP4001877 1.5 0.9 1.1 1.2 0.5 0.7 1 0.99 
NT2RP4002451 0.7 1 0.6 0.7 0.2 0.3 0.91 0.95 
NT2RP4000634 1 1 0.3 0.9 0.3 0.4 0.99 1 
NT2RP4002187 0.4 0.4 0.1 0.7 0.3 0.2 I 0.99 
NT2RP4002715 1.5 1.6 0.7 1.5 0.4 0.3 1 0.99 
MAMMA1000986 3.9 4.1 1.9 4.2 1.8 1.4 0.99 1 
MAMMA1001237 0 0 1.6 0.2 0 0 0.99 0.98 
MAMMA1001978 3.5 3.4 2.3 6 3.4 2.5 0.97 0.98 
MAMMA1002080 0.4 0 0 0.4 0.1 0 1 0.99 
MAMMA1002234 4 4.4 3 7.7 1.9 3 0.97 1 
MAMMA1000614 4.8 1 15.5 5.6 3.9 4.8 0.95 0.93 
MAMMA1000141 7.1 11.5 3.5 14.8 6.5 3.7 1 0.98 
MAMMA1000706 7.2 9.3 3.9 3.7 2.3 2.6 0.98 0.99 
MAMMA1000788 3 3.8 2.8 8.9 4.8 4.2 0.92 0.98 
MAMMA1000994 0.3 0 0 0.4 0 0 1 1 
MAMMA1001310 4.1 6.1 3.8 8 2.5 3.6 0.99 0.95 
MAMMA1001344 2.7 4.4 2.2 3.2 2.6 2.1 1 0.99 
MAMMA1001957 2.3 2.7 1.9 1.7 1 1.8 0.99 1 
MAMMA1O02070 0.1 0.1 0 0.8 0.4 0.2 1 0.99 
MAMMA1002586 1.7 1.6 1.2 1.3 0.4 0.3 0.94 1 
MAMMA1000102 2.1 2.3 1.4 3.3 1.6 1.6 1 1 
MAMMA1001066 2.8 2.6 1.8 5.3 0.7 1.2 1 0.98 
MAMMA1O01094 2.3 2.9 2 3.3 2.1 2.5 0.96 0.9 
MAMMA1001609 2 3 1.2 2.7 1.7 2.2 0.99 0.97 
PLACE1002547 2 1.7 1.2 4.1 1.2 2 0.95 1 
PLACE 1003573 0 0 0 0.1 0 0 1 0.98 
PLACE1004199 0.1 0.2 0 0 0 0 0.99 0.97 
PLACE1004305 0 0 0 0.3 0 0.2 0.96 0.99 
PLACE1004450 0.9 0.3 0 0.1 0 0 0.98 0.98 
PLACE1005031 0.9 0 0 0.5 0 0 0.98 0.99 
PLACE 1007845 0.8 1 0.4 0.4 0.1 0.1 1 0.98 
PLACE1008984 1.4 1.2 0.4 1.9 0.6 0.5 0.98 0.98 
PLACE1011116 2.6 1.5 1.6 1.6 0.3 0.4 1 1 
PLACE 1000986 0.6 0.2 0,2 0.3 0.1 0.1 1 0.98 
PLACE1004492 1.9 1.9 1.5 3.3 1 1 1 0.97 
PLACE1005569 2.6 0.4 0 1.1 0.3 0.1 0.98 0.99 
PLACE1005601 1.7 1.3 1 2.3 0.6 0.3 0.93 1 
PLACE 1006079 0.6 0.3 0 0.1 0.1 0 0.98 0.99 
PLACE1007077 1.1 0 0 0,3 0.1 0 0.97 0.98 
PLACE1008744 0.4 0.1 0.1 1.1 0.1 0 0.98 1 
PLACE1011181 0.6 0.3 0.5 1.6 0.3 0.5 0.98 0.99 
PLACE1005539 0.4 0 0.2 0.3 0.2 0 1 0.93 
PLACE 1008282 1.1 0.7 0.6 1.2 0.4 0.4 0.98 1 
PLACE 101 07 13 0.6 0.7 0 1.4 0.5 0.4 0.99 0.95 
PLACE1010011 1.2 1.4 0.2 2.7 1.5 1.7 1 0.99 
PLACE3000213 1.9 0.2 0.1 0.8 0.1 0 0.99 1 
PLACE1002080 6.7 3.9 0.3 17 0.8 0.5 0.95 0.98 
SKNMC1000082 1.3 0.1 1.1 0.7 0 0 1 1 
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Y79AA1000127 


1.8 


Y79AA1000226 


1.4 


Y79AA1000776 


0.3 


Y79AA1000876 


1.1 


Y79AA1001056 


1.7 


Y79AA1000777 


3.1 


Y79AA 1000030 


1 


Y79AA1001212 


1.5 


Y79AA1001427 


2.3 


Y79AA1001530 


0.9 


Y79AA1001592 


0.6 


Y79AA1001727 


0.8 


Y79AA1001803 


0.1 


Y79AA10O2373 


0 


Y79AA1 002376 


0.9 


Y79AA1001523 


0.5 


Y79AA1000888 


1.1 


Y79AA1002129 


0.2 



1.8 1.1 
0.8 0.6 
0.1 0 
1.5 1.2 
1.7 0.8 

3.1 1.2 
1.3 0.2 

1.2 1 
3 0.6 

0.9 0.5 
0.2 0 
0.4 0.2 

0 0 
0 0 
0.1 0 
0.5 0.3 

1 0.7 
0.2 0.1 



2.1 0.5 0.6 

0.9 0.3 0.4 

1.1 0.3 0.5 

1.3 0.5 0.8 

1.4 0.9 0.7 
3.8 0.7 0.5 
1.3 0 0.6 



1 1 

0.99 0.99 
0.99 0.99 
0.97 1 
1 1 
0.98 0.99 
0.98 0.96 



2 0.8 0.5 1 0.99 
2 0.8 0.4 1 1 

1.1 0.4 0.4 1 1 
0.7 0 0 0.97 1 
0.9 0.2 0.1 1 1 

0.2 0.1 0 0.97 0.99 
0 0 0 0.99 1 

1.2 0.1 0.4 0.98 1 
0.6 0.3 0.1 1 0.98 
1.4 0.7 0.5 1 1 
0.5 0.2 0.2 0.99 1 



[0285] The present invention has provided a total of 830 novel full length cDNA clones. As has not yet proceeded 
the isolation of full length cDNA within the human, the invention has a large significance. Those proteins such as 
secretory proteins, membrane proteins, and proteins associated with signal transduction, glycoprotein, and transcrip- 
tion are known to be associated with many diseases. Those genes and proteins associating with diseases are useful 
for developing medicines as they can be used as a diagnostic marker, or a target for gene therapy or developing 
medicines that is capable of regulating their expression and activity. Especially, the cDNA clones encoding a secretion 
protein are extremely important for medicinal industry since the protein itself is expected to be effective as a medicine, 
and also the gene may have potential to be associating with many diseases. Moreover, those proteins such as mem- 
brane proteins, and proteins associated with signal transduction, glycoprotein, transcription, and diseases, and the 
genes encoding the proteins may be used as a disease marker. These cDNA clones are also important for medicinal 
industry as they may be effective for treating diseases through the regulation of the expression and activity of their 
encoded proteins. 

Table 368 

The names of the representative sequences of the clusters (groups) and the corresponding SEQ 
IDs. 



HRIFA000016a : 1573 
HRIFA000071a: 1574 
HRIFA000116a: 1575 
HRIFA000123a : 1576 
HRIFA000264a : 1577 
HRIFA000284a : 1578 
HRIFA000327a : 1579 
HRIFA000415a : 1580 
HRIFA000432a : 1581 
HRIFA000446a : 1582 
HRIFA000553a : 1583 
HRIFA000564a : 1584 
HRIFA000631a: 1585 



HRIFA017855a : 1979 
HRIFA017921a : 1980 
HRJFA013075a: 1981 
HRJPA013092a : 1982 
HRIFA01:5131a: 1983 
HRIFA018134a : 1984 
HRIFA01 8238a: 1985 
HRIFA01 8262a: 1986 
HRIFA01 8287a : 1987 
HRIFA01 8447a : 1988 
HRIFA018580a : 1989 
HRIFA01 8666a: 1990 
HRIFA01 8688a: 1991 
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HRIFA012333a : 1846 
HRIFA012354a : 1847 
HRIFA012417a : 1848 
HRIFA012427a : 1849 
HRIFAO 12436a: 1850 
HRIFA012480a: 1851 
HRIFA012513a : 1852 
HRIFA012515a : 1853 
HRIFA012584a : 1854 
HRIFA012625a : 1855 
HRIFA012692a : 1856 
HRIFA012702a : 1857 
HRIFA012737a : 1858 
HRIFA012761a : 1859 
HRTFA012795a: 1860 
HRIFA012881a: 1861 
HRIFA012885a : 1862 
HRIFA012914a : 1863 
HRIFA012969a : 1864 
HRIFA012990a: 1865 
HRIFA012999a : 1866 
HRIFA013092a : 1867 
HRIFA013103a: 1868 
HRIFA013135a : 1869 
HRIFA013235a : 1870 
HRIFA013254a: 1871 
HRIFA013265a : 1872 
HRIFA013276a : 1873 
HRIFA013279a : 1874 
HRIFA013288a : 1875 
HRIFA013376a : 1876 
HRIFA013477a : 1877 
HRIFA013586a : 1878 
HRIFA01 3589a: 1879 
HRIFA01 3620a: 1880 
HRIFA013668a : 1881 
HREFA01 3726a: 1882 
HRIFA01 3744a: 1883 
HRIFA013899a: 1884 
HRIFA013911a: 1885 
HRIFA013919a : 1886 
HRIFAO 13932a: 1887 
HRIFA013980a : 1888 
HREFA014006a: 1889 
HRIFAO 14024a : 1890 
HRIFA014056a : 1891 
HRIFAOHllla: 1892 
HRIFA014133a: 1893 
HRIFA014178a : 1894 
HRIFA014185a : 1895 
HRIFA014303a : 1896 
HRIFA014336a : 1897 



HRIFA028501a:2252 
HRIFA028511a:2253 
HRIFA028573a : 2254 
HRIFA028576a : 2255 
HRIFA028592a : 2256 
HRIFA028614a : 2257 
HRIFA028651a : 2258 
HRIFA028652a : 2259 
HRIFA028654a : 2260 
HRIFA028708a : 2261 
HRIFA028790a : 2262 
HRIFA028804a : 2263 
HRIFA028867a : 2264 
HRIFA028911a:2265 
HREFA028926a : 2266 
HRIFA028983a : 2267 
HRIFA029002a : 2268 
HRIFA029()50a : 2269 
HRIFA029107a : 2270 
HRIFA029208a : 2271 
HRIFA029209a : 2272 
HRIFA029256a : 2273 
HRIFA029263a : 2274 
HRIFA029274a : 2275 
HRIFA029278a : 2276 
HRIFA029285a : 2277 
HRIFA029317a:2278 
HRIFA029327a : 2279 
HRIFA029349a : 2280 
HRIFA029393a : 2281 
HRIFA029398a : 2282 
HRIFA029425a : 2283 
HREFA029434a : 2284 
HRIFA029440a : 2285 
HRIFA029460a : 2286 
HRIFA029467a : 2287 
HRIFA029508a : 2288 
HRIFA029511a:2289 
HRIFA029577a : 2290 
HRIFA029602a : 2291 
HRIFA029649a : 2292 
HRIFA029715a : 2293 
HRIFA029730a : 2294 
HRIFA029779a : 2295 
HRIFA029792a : 2296 
HRIFA029802a : 2297 
HRIFA029866a : 2298 
HRIFA029932a : 2299 
HRIFA030025a : 2300 
HRIFA030045a : 2301 
HRIFA03()097a : 2302 
HRIFA030103a : 2303 
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HRIFA014396a : 1898 
HRIFA014397a : 1899 
HRIFA014417a : 1900 
HRIFA014465a : 1901 
HRIFA014467a : 1902 
HRIFA014482a : 1903 
HRIFA014500a : 1904 
HRDFA014561a: 1905 
HRIFAO 14568a : 1906 
HRIFA014590a : 1907 
HRIFA014598a : 1908 
HRIFA014620a : 1909 
HRIFA014621a: 1910 
HRIFA014688a : 1911 
HRIFA014692a : 1912 
HREFA014702a : 1913 
HRIFA014819a : 1914 
HRIFA014868a : 1915 
HRIFA01495U: 1916 
HRIFA014953a : 1917 
HRIFA014967a : 1918 
HRIFA015063a : 1919 
HREFA015070a : 1920 
HRIFA015122a : 1921 
HREFA015129a : 1922 
HRIFA015219a : 1923 
HRIFA015236a : 1924 
HRIFA015246a : 1925 
HRIFA015351a: 1926 
HRIFA015409a : 1927 
HREFA015423a : 1928 
HRIFA015443a : 1929 
HRIFA015453a : 1930 
HRIFA015471a: 1931 
HRIFA015486a : 1932 
HRIFA015506a : 1933 
HRIFA015536a : 1934 
HRIFA015547a : 1935 
HRIFAO 15568a : 1936 
HRIFA015671a: 1937 
HRIFA015682a : 1938 
HRIFA015756a : 1939 
HRIFA015764a : 1940 
HRIFA015802a : 1941 
HRIFA015811a: 1942 
HRIFA01 5902a: 1943 
HRDFA01 5947a: 1944 
HRIFA015995a : 1945 
HRIFA01 6070a: 1946 
HRIFA016129a : 1947 
HRIFA016214a : 1948 
HRIFA016240a : 1949 



HRIFA030106a : 2304 
HRIFA030147a : 2305 
HRIFA030203a : 2306 
HRIFA030237a : 2307 
HRIFA030248a : 2308 
HRIFA030250a : 2309 
HRIFA030264a : 2310 
HRIFA030342a: 2311 
HRIFA030370a : 2312 
HRIFA030371a:2313 
HRIFA030381a:2314 
HRIFA030385a : 2315 
HRIFA030394a : 2316 
HRIFA030408a : 2317 
HRIFA030411a: 2318 
HRIFA030448a : 2319 
HRDFA030456a : 2320 
HREFA030461a:2321 
HRIFA030472a : 2322 
HRIFA030509a : 2323 
HRIFA030511a:2324 
HRIFA030545a : 2325 
HRIFA030566a : 2326 
HRIFA030599a : 2327 
HRIFA030629a : 2328 
HRIFA030642a : 2329 
HRBFA030662a : 2330 
HRDFA030839a:2331 
HRIFA030981a: 2332 
HREFA03 1062a : 2333 
HRIFA031075a:2334 
HRIFA031091a:2335 
HRIFA031126a:2336 
HRlFA031249a:2337 
HRIFA031336a:2338 
HREFA031350a:2339 
HRIFA031395a:2340 
HREFA031397a:2341 
HRIFA031438a:2342 
HRIFA031472a:2343 
HRIFA031510a:2344 
HRIFA031672a:2345 
HRIFA031869a : 2346 
HRIFA031871a:2347 
HRIFA031895a:2348 
HRIFA031935a:2349 
HRIFA031986a : 2350 
HRIFA032009a:2351 
HRIFA032011a:2352 
HRIFA032066a : 2353 
HRIFA032067a : 2354 
HRIFA032070a : 2355 
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HRIFA016255a : 1950 
HRIFA01 6290a : 1951 
HRIFA016430a : 1952 
HRIFA01 6599a : 1953 
HRIFA01 6623a : 1954 
HRIFA01 6639a: 1955 
HRIFA0l6654a : 1956 
HRIFA016669a : 1957 
HRJFA016758a : 1958 
HRIFA016838a : 1959 
HRIFA01 6963a : 1960 
HRIFA017031a : 1961 
HRIFA017146a: 1962 
HRIFA017190a: 1963 
HRIFA017257a : 1964 
HRIFA017295a : 1965 
HRIFA017312a : 1966 
HRIFA017456a : 1967 
HRIFA017457a : 1968 
HRIFA017509a : 1969 
HRIFA017594a : 1970 
HRIFA017643a : 1971 
HRIFA017670a : 1972 
HRIFA017703a : 1973 
HRIFA017729a : 1974 
HRIFA017791a : 1975 
HRIFA017801a : 1976 
HRIFA017818a : 1977 
HRIFA017836a : 1978 



HRIFA032073a : 2356 
HRIFA032079a : 2357 
HRIFA032097a : 2358 
HRIFA032161a : 2359 
HRIFA032186a : 2360 
HRlFA032224a : 2361 
HRIFA032257a : 2362 
HRIFA032271a : 2363 
HRIFA032274a : 2364 
HRffA032275a : 2365 
HRIFA032360a : 2366 
HRIFA032389a : 2367 
HRIFA032433a : 2368 
HRIFA032453a : 2369 
HRIFA032478a : 2370 
HRIFA032506a : 2371 
HRIFA032511a:2372 
HRIFA032530a : 2373 
HRIFA032587a : 2374 
HRIFA032(505a : 2375 
HRIFA032642a : 2376 
HRIFA032(596a : 2377 
HRIFA032730a : 2378 
HRIFA032820a : 2379 
HRIFA032984a : 2380 
HRIFA033349a : 2381 
HRIFA033718a : 2382 
HRIFA034010a : 2383 
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30 



35 



40 



45 



50 



Table 369 

The names of the internal sequences that are used in the selection of the clones from the 
representative sequences, and the corresponding SEQ EDs. 



10 



15 



20 



25 



AA533598 : 2384 HRIFA036799a : 2463 
AI05 1 329 : 2385 HRIFA037 1 38a : 2464 
HRIFA000595a : 2386 HRIFA037776a : 2465 



HRIFA000667a : 2387 
HRIFA000878a : 2388 
HRIFA00 1269a : 2389 
HRIFA001283a : 2390 
HRIFA002000a : 2391 
HRIFA002196a : 2392 
HRIFA003583a : 2393 
HRlFA005077a : 2394 
HRIFA005781a:2395 
HRIFA006216a:2396 
HRIFA006468a : 2397 
HRIFA006822a : 2398 
HREFA007048a : 2399 
HRIFA007661a : 2400 



HRTFA037838a : 2466 
HRIRAOOOOOla: 2467 
HRIRA00004Ia:2468 
HRIRA000058a : 2469 
HRIRA000260a : 2470 
HRIRA00()490a:2471 
HRIRA000522a : 2472 
HRIRA000553a : 2473 
HRIRA000563a : 2474 
HRIRA000640a : 2475 
HRIRA000725a : 2476 
HRIRA000998a : 2477 
HRIRAOO 1053a : 2478 
HRIRA001314a:2479 



55 
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HRIFA007777a : 2401 
HRIFA0O7997a : 2402 
HRIFA0O8312a : 2403 
HRIFA009250a : 2404 
HRIFA009495a : 2405 
HRIFA009607a : 2406 
HRJDFA009923a : 2407 
HRIFA009978a : 2408 
HRIFA010730a : 2409 
HRIFA01 1029a: 2410 
HRIFA011416a:2411 
HRIFA011461a : 2412 
HRlFA012670a : 2413 
HRIFA012717a : 2414 
HRIFA012802a:2415 
HRIFA013357a : 2416 
HRIFA01 3484a : 2417 
HRIFA015333a : 2418 
HRIFA015375a : 2419 
HRIFA015663a : 2420 
HRIFA016287a:2421 
HRIFA016302a : 2422 
HRIFA016782a : 2423 
HRIFA018555a : 2424 
HRIFA0l9338a : 2425 
HRIFA020315a : 2426 
HRIFA020806a : 2427 
HRIFA022264a : 2428 
HRJFA022923a : 2429 
HRIFA023027a : 2430 
HRIFA023218a:2431 
HRIFA023363a : 2432 
HRIFA023434a : 2433 
HRIFA023444a : 2434 
HRIFA023551a:2435 
HRIFA023558a : 2436 
HREFA023641a : 2437 
HRIFA023798a : 2438 
HRIFA024330a : 2439 
HRIFA024338a : 2440 
HRJFA024384a : 2441 
HRIFA024644a : 2442 
HRIFA025170a:2443 
HREFA025496a : 2444 
HRIFA025565a : 2445 
HRIFA025651a : 2446 
HRIFA026224a : 2447 
HRIFA026729a : 2448 
HRIFA026925a : 2449 
HRIFA028501a : 2450 
HRJFA029454a : 2451 
HRIFA030181a : 2452 



HRIRAOO 1443a : 2480 
HRIRA001473a:2481 
HRIRA001648a:2482 
HRIRA001690a : 2483 
HRIRA001726a : 2484 
HRIRAOO 1884a : 2485 
HRIRA002098a : 2486 
HRIRA002100a : 2487 
HRIRA002155a : 2488 
HRIRA002307a : 2489 
HRIRA002442a : 2490 
HRIRA002446a : 2491 
HRIRA002479a : 2492 
HRIRA002945a : 2493 
HRIRA0O3O28a : 2494 
HRIRA0O3108a : 2495 
HRIRA003139a:2496 
HRIRA003819a : 2497 
HRIRA004049a : 2498 
HRIRA004286a : 2499 
HRIRA004583a : 2500 
HRIRA004691a : 2501 
HRIRA004783a : 2502 
HRIRA005152a : 2503 
HRIRA005221a : 2504 
HRIRA005227a : 2505 
HRIRA005305a : 2506 
HRIRA005563a : 2507 
HRIRA006263a : 2508 
HRIRA006324a : 2509 
HRIRA006517a : 2510 
HRIRA006580a:2511 
HRIRA007665a : 2512 
HRIRA007680a:2513 
HRIRA008129a : 2514 
HRIRA008152a : 2515 
HRIRA008276a : 2516 
HRIRA008329a : 2517 
HRIRA008854a:2518 
HRIRA008896a:2519 
HRIRA008958a : 2520 
HRIRA009551a:2521 
HRIRA009828a : 2522 
HRIRA010472a : 2523 
HRIRA012442a : 2524 
HRIRA012921a : 2525 
HRIRA013325a:2526 
HRIRAO 13644a : 2527 
HRIRA013675a:2528 
HRIRAO 13702a : 2529 
HRIRAO 13757a : 2530 
HRIRA013951a : 2531 
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HRIFA032701a : 2453 
HRIFA032801a : 2454 
HRIFA033384a : 2455 
HRIFA033682a : 2456 
HRIFA033930a : 2457 
HRIFA034817a : 2458 
HRIFA035409a : 2459 
HRIFA035542a : 2460 
HRIFA035577a : 2461 
HRIFA036630a : 2462 



HRIRA014256a : 2532 
HRIRA014380a : 2533 
HRIRA015831a:2534 
HRIRA015904a:2535 
HRIRA016124a:2536 
HRIRA017071a:2537 
HRIRA018191a:2538 
HRIRA02()304a : 2539 
HRIRAO0()579a : 2540 



[0286] The internal sequences include EST, HRIFA(the representative sequence of the 5'-end), and HRIRA (the 
representative sequence of the 3'-end). 
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Table 370 



clone the name of SEQ ID NO SEQ ID NO 
full-length of the of the 
nucleotide full-length deduced 
sequences nucleotide amino acid 

sequences sequences 



rLbMBA 1 OOOOOo 


f-y T_T~C\ /TD A 1 AAAAAiC 

C-rLbMBAlOOOOOo 


2547 


2548 


nnnnnnnnnnnn C- 


nnnnnnnnnnnn nnnn 


nnnn 




lIt3\JTD A 1 AAA 1 ^ 1 

HEMB A 1 000 1 2 1 


TJTC\ /TD A 1 AAA lOI 

C-ltbMB A 1 UU0 1 2 1 


syce 1 
2551 


ACM 

2552 


TTT7X jrr\ A 1 AAA 1 OO 

HhMB A 1 000 1 2 o 


/"* TTT?\ /TD A 1 AAA \**)Q 

C-H£MBA 1 000 1 2o 


2553 


2554 


HEMBA1000275 


f-i XJTCX JTD A 1 AAAT7C 

C-HEMB A 1 00027 5 


2555 


2556 


TTn\AT> A 1 AAAO AA 

HcMd A 1 OUUiCXJ 


ttt-'a rn a 1 AAA 2 AA 

C-riEMB A 1UUU JUU 


2557 




TTPN/TR A 1 000^40 
IUD1VJLD/V 1 \J\JUJHy 


C HPMR A 1 000^40 


9^8 


9^Q 


IXDTV/TD A 1 AAA/1 A 1 

nhMoA 1 UUU443 




25oU 


25ol 


UTC A/TO A 1 AAACQA 

JrLbMrJ A 1 UUU 5V U 


O tTCX/TD A 1 AAACQA 


2502 


200.5 


ribMxS A 1 UUUo 34 


r 1 ITCX/TO A 1 AAA/C1/I 

U-i±bJVl±>A 1 UUUO J4 


2504 


Z5o5 


nxiJVlDAlUUU/ i j 




9^A£ 


2 JO/ 


rlJClYLD/V 1 UUU / H-J 


r* WTp\/fR a 1 (\r\r\iAs 

V^-IxClVLD J\ I UUU / *rD 


9^£ft 
ZOOo 


9^£Q 
ZJ07 


WF7V/TR A 1 DO AQH7 
XIXjIYLOA I UUUi7U / 


^-JXDlVlD/ilUUU^U / 


Zj /U 


9^71 
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Homology search result 1 

[0287] The result of the homology search in the SwissProt using the representative sequences of the 5'-ends. 
Indicated are from the top, 

the name of the representative sequence of the cluster, 
definition of the top hit data, 

the P-value: the length of the sequence used for comparison (nucleotide):similarity (%), 
the organism of which the top hit data is obtained, 
the Accession No. of the top hit data. 
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[0288] Homology search results of the representative sequences of the 5'-end cluster to the data in SwissProt da- 
tabase are shown only for the representative sequences of the cluster from which clones were selected based on the 
homology search results. 

[0289] The P-value is the score which is determined by taking into account the statistic probability of occurrence 
5 between the two sequences, and generally low score reflects high similarity. (Altschul, S.F., Gish, W. ( Miller, W., Myers, 
E.W. & Lipman, D.J. (1990) "Basic local alignment search tool." J. Mol. Biol. 215:403-410; Gish, W. & States, D.J. 
(1993) "Identification of protein coding regions by database similarity search." Nature Genet. 3:266-272). 
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9.2e-05:1 78:32 

PHASEOLUS VULGARIS (KIDNEY BEAN) (FRENCH BEAN). 
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CIRCUMSPOROZOITE PROTEB PRECURSOR (CS). 
5.8e-05:1 94:29 
PLASMODIUM SIMIUM. 
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HYPOTHETICAL 68.7 KD PROTEIN ZK757.1 IN CHROMOSOME III. 
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HYPOTHETICAL 64.8 KD PROTEIN IN GDI1-COX15 INTERGENIC REGION. 
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SUS SCROFA (PIG). 
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1.2e-27: 144:45 

SCHIZOSACCHAROMYCES POM BE (FISSION YEAST). 
P28829 

40 

HRIFA029932a 
F-SPONDIN PRECURSOR. 
9.1 e-24: 191:37 

XENOPUS LAEVIS (AFRICAN CLAWED FROG). 
45 P35447 

HRIFA030025a 

ENDOSOMAL P24A PROTEIN PRECURSOR (70 KD ENDOMEMBRANE PROTEIN) (PHEROMONE ALPHA- 
FACTOR TRANSPORTER) (ACIDIC 24 KD LATE ENDOCYTIC INTERMEDIATE COMPONENT). 
so 1.0e-11:138:31 

SACCHAROMYCES CEREVISIAE (BAKER'S YEAST). 
P32802 

HRIFA030045a 
55 SARCALUMENIN PRECURSOR. 

2.4e-20:1 51:32 

ORYCTOLAGUS CUNICULUS (RABBIT). 
P13666 
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HRIFA032642a 

PROLINE-RICH PROTEIN MP-2 PRECURSOR. 
5.0e-05:1 27:33 
MUS MUSCULUS (MOUSE). 
5 P05142 

HRIFA032696a 

COLLAGEN ALPHA 1(11) CHAIN (FRAGMENTS). 
1.4e-13:200:38 
10 BOS TAURUS (BOVINE). 

P02459 

HRIFA032730a 
K-GLYPICAN PRECURSOR. 
15 4.8e-67:1 80:68 

MUS MUSCULUS (MOUSE). 
P51655 

HRIFA032820a 
20 GLUTAMIC ACID-RICH PROTEIN PRECURSOR. 

7.5e-05:1 92:23 

PLASMODIUM FALCIPARUM (ISOLATE FC27 / PAPUA NEW GUINEA). 
P13816 

25 Homology search result 2 

[0290] Homology of representative sequences of the 5'-end cluster to the data in Swiss- Prot database 

[0291] Representative sequence of the 5'-end cluster exhibiting relatively high homology (304 cluster: "exhibiting 

relatively high homology" means that the P value is 1 0 -10 or less) 

30 HRIFA000327a, HRIFA000432a, HRIFA000553a, HRIFA000564a, HRIFA000631a, HRIFA000683a, HRIFA000776a, 
HRIFA000814a, HRIFA001132a, HRIFA001138a t HRIFA001337a, HRIFA001341a, HRIFA001489a, HRIFA001712a, 
HRIFA001720a, HRIFA001942a, HRIFA001975a, HRIFA001984a, HRIFA002384a, HRIFA002503a, HRIFA002743a, 
HRIFA002766a, HRIFA002805a, HRIFA002891a, HRIFA002919a, HRIFA002980a, HRIFA003063a, HRIFA003093a, 
HRIFA003635a, HRIFA004006a, HRIFA004034a, HRIFA004112a, HRIFA004426a, HRIFA004490a, HRIFA004523a, 

35 HRIFA004663a, HRIFA004696a, HRIFA004714a, HRIFA004745a, HRiFA004919a, HRIFA005184a f HRIFA005231a t 
HRIFA005240a, HRIFA005271a, HRIFA005372a, HRIFA005392a, HRIFA005409a ( HRIFA005420a, HRIFA005438a, 
HRIFA005462a, HRIFA005644a, HRIFA005720a, HRIFA005732a, HRIFA005760a, HRIFA005781a, HRIFA006183a, 
HRIFA006494a t HRIFA006510a, HRIFA006566a, HRIFA006586a, HRIFA006596a, HRIFA006649a, HRIFA006667a, 
HRIFA006730a, HRIFA006926a, HRIFA007013a, HRIFA007219a, HRIFA007228a, HRIFA007243a, HRIFA007352a, 

40 HRIFA007424a, HRIFA007435a, HRIFA007463a, HRIFA007493a, HRIFA007571a f HRIFA007659a, HRIFA007722a, 
HRIFA007745a, HRIFA008000a, HRIFA008200a, HRIFA008284a, HRIFA008314a, HRIFA008362a, HRIFA008459a, 
HRlFA008483a, HRIFA008547a, HRIFA008611a, HRIFA008661a, HRIFA008717a, HRIFA008784a, HRIFA008981a, 
HRIFA009101a, HRIFA009171a, HRIFA009220a, HRIFA009451a, HRIFA009482a t HRIFA009783a, HRlFA009881a, 
HRIFA010085a, HRIFA010090a, 

45 HRIFA010130a, HRIFA010319a, HRIFA010394a, HRIFA010460a ( HRIFA010790a, HRIFA010975a, HRIFA011016a, 
HRIFA011179a, HRIFA011197a, HRIFA011449a, HRIFA011659a, HRIFA011947a, HRIFA012278a, HRIFA012584a t 
HRIFA012625a, HRIFA012692a, HRIFA012795a, HRIFA012885a, HRIFA012914a, HRIFA012969a, HRIFA012990a t 
HRIFA013254a, HRIFA013265a ( HRIFA013276a, HRIFA013376a, HRIFA013477a, HRIFA013586a, HRIFA013726a, 
HRIFA013744a, HRIFA013911a, HRIFA014006a, HRIFA014185a, HRIFA014336a, HRIFA014465a, HRIFA014500a, 

50 HRIFA014561a, HRIFA014568a ( HRIFA014621a, HRIFA014688a, HRIFA014819a ( HRIFA014951a, HRIFA014967a, 
HRIFA01 5063a, HRIFA01 5070a, HRIFA01 5246a, HRIFA01 5423a, HRIFA01 5453a, HRIFA01 5486a, HRIFA01 5506a, 
HRIFA015536a, HRIFA015547a, HRIFA015568a, HRIFA015756a, HRIFA015811a, HRIFA016070a, HRIFA016290a, 
HRIFA01 6430a, HRIFA01 6654a, HRIFA01 6758a, HRIFA017031a, HRIFA01 7257a, HRIFA01 7295a, HRIFA01 7312a, 
HRIFA017703a, HRIFA017855a, HRIFA01 8092a, HRIFA018131a, HRIFA018134a, HRIFA018580a, HRIFA018827a, 

55 HRIFA018904a, HRIFA018993a, HRIFA019105a, HRIFA019136a, HRIFA019175a, HRIFA019262a, HRIFA019466a, 
HRIFA019867a, HRIFA01 9869a, HRIFA020272a, HRIFA020335a, HRIFA020349a, HRIFA020862a, HRIFA021213a, 
HRIFA021398a, HRIFA021499a, HRIFA021637a, HRIFA021651a, HRIFA021754a, HRIFA021781a, HRIFA022065a, 
HRIFA022139a, HRIFA022166a, HRIFA022177a, HRIFA022182a, HRIFA022227a, HRIFA022249a, HRIFA022265a, 
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HRIFA022328a, HRIFA022423a, HRIFA022528a, HRIFA022546a, HRIFA022564a, HRIFA022616a, HRIFA022671a, 
HRIFA022691a, HRIFA022707a, HRIFA022729a t HRIFA022737a, HRIFA022776a, HRIFA022875a, HRAFA022895a, 
HRIFA023007a, HRIFA023227a, HRIFA023257a, HRIFA023304a, HRIFA023464a, HRIFA023767a, HRIFA023923a, 
HRIFA024132a, HRIFA024255a, HRIFA024392a, HRIFA024423a, HRIFA024504a, HRIFA024718a, HRIFA024767a, 

5 HRIFA024937a, HRIFA024994a, HRIFA025046a, HRIFA025250a, HRIFA025261a, HRIFA025353a, HRIFA025492a, 
HRIFA025636a, HRIFA025695a, HRIFA025706a, HRIFA025766a, HRIFA025800a, HRIFA025907a, HRIFA025913a, 
HRIFA026089a ( HRIFA026364a, HRIFA026496a, HRIFA026789a, HRIFA026813a, HRIFA026860a, HRIFA027012a, 
HRIFA027045a, HRIFA027125a, HRIFA027179a, HRIFA027187a, HRIFA027622a, HRIFA027625a, HRIFA027656a, 
HRIFA027681a, HRIFA027722a, HRIFA027940a t HRIFA028157a, HRIFA028402a, HRIFA028468a, HRIFA028511a, 

10 HRIFA028651a, HRIFA028790a, HRIFA029002a, HRIFA029208a, HRIFA029209a, HRIFA029256a, HRIFA029263a, 
HRIFA029285a, HRIFA029317a, HRIFA029327a, HRIFA029393a, HRIFA029511a, HRIFA029802a, HRIFA029866a, 
HRIFA029932a, HRIFA030025a, HRIFA030045a, HRIFA030250a, HRIFA030342a, HRIFA030370a, HRIFA030371a, 
HRIFA030411a ? HRIFA030448a, HRIFA030545a, HRIFA030629a, HRIFA030642a t HRIFA030662a t HRIFA031336a, 
HRIFA031869a, HRIFA031986a, HRIFA032009a, HRIFA032011a, HRIFA032070a, HRIFA032073a, HRIFA032079a, 

15 HRIFA032224a, HRIFA032274a, HRIFA032275a, HRIFA032433a, 
HRIFA032453a, HRIFA032605a, HRIFA032696a, HRIFA032730a, 

Homology search result 3 

20 [0292] Representative sequence of the 5'-end cluster exhibiting relatively low homology (221 cluster: "exhibiting 
relatively low homology" means that the P value is higher than 10 -10 and 10* 4 or less) 

HRIFA000016a, HR1FA000071a, HRlFA000116a, HRIFA000123a, HRIFA000264a, HRIFA000415a, HRIFA000446a f 
HRIFA000695a, HRIFA000845a, HRIFA001971a, HRIFA002063a, HRIFA002102a, HRIFA002284a, HRIFA002309a, 
HRIFA002694a, HRIFA002762a, HRIFA002787a, HRIFA003055a, HRIFA003340a, HRIFA003402a, HRIFA003504a, 

25 HRIFA003892a, HRIFA003946a, HRIFA004162a, HRIFA004401a t HRIFA004780a, HRIFA005072a, HRIFA005102a, 
HRIFA005214a, HRIFA005255a, HRIFA005300a, HRIFA005369a, HRIFA005702a, HRIFA005728a, HRIFA005944a; 
HRIFA006298a, HRIFA006448a, HRIFA006572a, HRIFA006633a, HRIFA006642a, HRIFA007068a, HRIFA007244a, 
HRIFA007262a, HRIFA007512a, HRIFA007532a, HRIFA007565a, HRIFA007728a, HRIFA007909a, HRIFA008174a, 
HRIFA008426a, HRIFA008596a, HRIFA008790a, HRIFA008989a, HRIFA009578a, HRIFA009825a, HRIFA009852a, 

30 HRIFA009983a, HRIFA010005a ( HRIFA010078a, HRIFA010152a, HRIFA010301a t HRIFA010361a, HR1FA010425a, 
HRIFA010466a, HRIFA010799a, HRIFA011580a, HRIFA011820a, HRIFA012167a, HRIFA012354a t HRIFA012427a, 
HRIFA012436a ( HRIFA01 2515a, HRIFA012702a, HRIFA012737a, HRIFA013135a, HRIFA013235a, HRIFA013279a t 
HRIFA013589a, HRIFA013620a, HRIFA013919a, HRIFA013932a, HRIFA014056a, HRIFA014111a, HRIFA014133a, 
HRIFA014396a, HRIFA014397a, HRIFA014598a, HRIFA014702a, HRIFA014868a, HRIFA015219a, HRIFA015995a, 

35 HRIFA016214a, HRIFA016240a, HRIFA016255a, HRIFA016639a, HRIFA016669a ( HRIFA016963a, HRIFA017457a ( 
HRIFA01 7643a, HRIFA01 7670a, 

HRIFA017801a, HRIFA017836a, HRIFA017921a, HRIFA018238a, HRIFA018262a, HRIFA018287a, HRIFA018666a, 
HRIFA018688a, HRIFA018754a, HRIFA018794a, HRIFA018870a, HRIFA018931a, HRIFA019412a, HRIFA019490a, 
HRIFA01 9498a, HRIFA01 9532a, HRIFA019651a, HRIFA0201440, HRIFA020184a, HRIFA020453a, HRIFA020693a, 

40 HRIFA020707a, HRIFA020748a, HRIFA021061a, HRIFA021224a, HRIFA021494a, HRIFA021794a, HRIFA021855a, 
HRIFA021906a, HRIFA022156a, HRIFA022203a, HRIFA022234a, HRIFA022702a, HRIFA022728a, HRIFA022782a, 
HRIFA022865a, HRIFA022890a, HRIFA022985a, HRIFA023048a, HRIFA023069a, HRIFA023129a, HRIFA023154a, 
HRIFA023212a, HRIFA023489a, HRIFA023634a, HRIFA023894a, HRIFA024088a, HRIFA024197a, HRIFA024218a, 
HRIFA024473a, HRIFA024482a, HRIFA024543a, HRIFA025327a, HRIFA025479a, HRIFA025488a, HRIFA025703a, 

45 HRIFA025771a, HRIFA025778a, HRIFA025904a, HRIFA025966a, HRIFA025978a, HRIFA026121a ; HRIFA026242a, 
HRIFA026316a t HRIFA026382a, HRIFA026465a, HRIFA026519a, HRIFA026564a, HRIFA026576a, HRIFA026618a, 
HRIFA026659a, HRIFA026764a, HRIFA027327a, HRIFA027329a, HRIFA027355a, HRIFA027644a, HRIFA027673a, 
HRlFA027714a, HRIFA027860a, HRIFA028061a, HRIFA028187a, HRIFA028262a, HRIFA028371a, HRIFA028440a ( 
HRIFA028501a, HRIFA028576a, HRIFA028614a, HRIFA028911a, HRIFA029050a, HRIFA029278a, HRIFA029349a, 

50 HRIFA029425a, HRIFA029434a, HRIFA029460a, HRIFA029467a, HRIFA029508a, HRIFA029730a, HRIFA029792a, 
HRIFA030103a, HRIFA030147a, 

HRIFA030264a, HRIFA030381a, HRIFA030456a, HRIFA030509a, HRIFA030511a, HRIFA030566a t HRIFA030599a, 
HRIFA031126a, HRIFA031249a, HRIFA031438a, HRIFA031935a, HRIFA032257a, HRIFA032360a, HRIFA032389a, 
HRIFA032478a, HRIFA032506a, HRIFA032511a, HRIFA032530a, HRIFA032587a, HRIFA032642a, HRIFA032820a, 

55 

Homology search result 4 

[0293] Representative sequence of the 5'-end cluster exhibiting low homology (1 1 5 cluster: "exhibiting low homology" 
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P00989 

F-HEMBB1000567 

HISTIDINE-RICH GLYCOPROTEIN PRECURSOR. 
5 5.0e-05:1 31:29 

PLASMODIUM LOPHURAE. 
P04929 

F-HEMBB1 000642 
10 BASIC PROLINE-RICH PEPTIDE IB-1 . 

0.0074:66:31 

HOMO SAPIENS (HUMAN). 
P04281 

15 F-HEMBB1 000668 

VEGETATIBLE INCOMPATIBILITY PROTEIN HET-E-1. 
7.3e-1 0:1 84:32 
PODOSPORA ANSERINA 
Q00808 

20 

F-HEMBB1 000679 

TRAM PROTEIN (TRANSLOCATING CHAIN-ASSOCIATING MEMBRANE PROTEIN). 
9.5e-73:204:69 
CANIS FAMILIARIS (DOG). 
25 Q01685 

F-HEMBB1 000881 
F-SPONDIN PRECURSOR. 
1.2e-23: 191:37 
30 XENOPUS LAEVIS (AFRICAN CLAWED FROG). 

P35447 

F-HEMBB1 000905 

TRANSCRIPTIONAL REPRESSOR RCO-1. 
35 0.068:105:34 

NEUROSPORA CRASSA. 
P78706 

F-HEMBB1001026 

40 ENDOSOMAL P24A PROTEIN PRECURSOR (70 KD ENDOMEMBRANE PROTEIN) (PHEROMONE ALPHA- 

FACTOR TRANSPORTER) (ACIDIC 24 KD LATE ENDOCYTIC INTERMEDIATE COMPONENT). 
1.3e-11:138:31 

SACCHAROMYCES CEREVISIAE (BAKER'S YEAST). 
P32802 

45 

F-HEMBB1001048 
SARCALUMENIN PRECURSOR. 
3.1 e-20:1 51:32 

ORYCTOLAGUS CUNICULUS (RABBIT). 
50 P13666 

F-HEMBB1001200 

HYPOTHETICAL GENE 51 MEMBRANE PROTEIN. 

1.0:66:27 

55 ICTALURID HERPESVIRUS 1 (CHANNEL CATFISH VIRUS) (CCV). 

Q00135 

F-HEMBB1001407 
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F-Y79AA1 002378 

ZINC FINGER PROTEIN 38 (ZFP-38) (CTFIN51) (TRANSCRIPTION FACTOR RU49). 
1.0e-59:1 63:74 
MUS MUSCULUS (MOUSE). 
5 Q07231 

F-Y79AA1 002381 

CELL DIVISION CONTROL PROTEIN 28 (EC 2.7.1.-). 
9.5e-41: 179:38 

10 SACCHAROMYCES CEREVISIAE (BAKER'S YEAST). 

P00546 

Homology search result 6 

is [0296] The result of the homology search in the Gen Ban k(http://www.ncbi.n lm.nih.gov/web/GenBank/) using the 
clone sequences of the S'-ends. except EST and STS sequences 

Indicated are from the top, 
the name of the clone sequence, 
20 definition of the top hit data, 

the P-value: the length of the sequence used for comparison (nudeotide):similarity (%), 
the Accession No. of the top hit data. 

[0297] Data were not shown for the clones in which the P-value was higher than 1 . 

25 

F-BNGH41 000020 

H. sapiens mitochondrial DNA, complete genome. 
6.0e-188:913:97 

X93334 

30 

F-BNGH41 000087 

Human DNA sequence from clone 1049G16 on chromosome 20q12-13.2 Contains gene similar to GLU- 
COSAMINE'S ULFATASE, a nuclear receptor coactivator gene, ESTs, STSs, GSSs, complete sequence. 
7.1 e-32:1 76:99 
35 AL034418 

F-BNGH41 000091 

Homo sapiens potassium channel h-eag. 

I . 6e-79:687:76 
40 AJ001366 

F-HEMBA1 000006 

S.erythraea second and third ORF's of eryA gene, complete cds. 
0.95:243:64 
45 M63677 

F-HEMBA1000121 

Human DNA sequence *** SEQUENCING IN PROGRESS *** from clone 796F18, WORKING DRAFT SE- 
QUENCE. 
50 5.9e-70:450:89 
AL031291 

F-HEMBA1000128 

Plasmodium falciparum DNA *** SEQUENCING IN PROGRESS *** from contig 3-14, complete sequence. 
55 1.0:274:59 
Z98549 

F-HEMBA1 000275 
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F-HEMBB1 000106 

Human DNA sequence from clone 1170D6 on chromosome Xq22.3-23. Contains a pseudogene similar to U- 
SNRNP_associated Cyclophilin (USA-CYP, EC 5.2.1 .8), ESTs, an STS and a GSS, complete sequence. 
0.033:332:61 
5 AL030995 

F-HEMBB1 000276 

Dictyostelium discoideum gene encoding a novel glycoprotein. 
0.00070:440:60 
10 AJ005262 

F-HEMBB1 000309 

Homo sapiens zinc finger protein (MBLL) mRNA, complete cds. 
7.6e-34:1 80:100 
15 AF061261 

F-HEMBB1 000407 

Human Chromosome 11 pac pDJ393o15, WORKING DRAFT SEQUENCE, 8 unordered pieces. 0.16:228:64 
AC000384 

20 

F-HEMBB1 000447 

Homo sapiens JWA protein mRNA, complete cds. 

1.4e-1 58:750:98 

AF070523 

25 

F-HEMBB1 000542 

Human DNA sequence from PAC 509L4 on chromosome 6q22.1-6q22.33. Contains SSX3 like pseudogene, EST, 
STS. 

4.3e-1 41:874:89 
30 Z99496 

F-HEMBB1 000567 

Human DNA for insulin-like growth factor II (IGF-2); exon 7 and additional ORF. 
9.7e-1 22:572:99 
35 X07868 

F-HEMBB1 000642 

F-HEMBB1 000668 
40 Caenorhabditis elegans cosmid K06A5. 

0.00041:174:64 
AF039038 

F-HEMBB1 000679 
45 C.familiaris mRNA for TRAM-protein. 

6.1e-100:756:80 
X63678 

F-HEMBB1 000881 
so Danio rerio mRNA for MINDIN2, complete cds. 

6.2e-40:581 :66 
AB006085 

F-HEMBB1 000905 

55 Homo sapiens clone RG315H11 , WORKING DRAFT SEQUENCE, 5 unordered pieces. 

4.9e-91:209:94 
AC005089 
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1.1 e-1 32:805:88 
U39045 

F-Y79AA1 002378 

5 Mus musculus mRNA for zinc finger protein, complete cds, clone:CTfin51 . 

1.9e-64:521:78 
D10630 

F-Y79AA1 002381 
10 O.sativa mRNA for cdc2+/CDC28-related protein kinase. 

3.3e-21:431:60 
X58194 

Homology search result 7 

15 

[0298] The result of the homology search in the GenBank(http://www.ncbi. nlm.nih.gov/web/Gen Bank/) using the 
clone sequences of the 3'-ends. except EST and STS sequences. 

Indicated are from the top, 
20 the name of the clone sequence, 

definition of the top hit data, 

the P-value: the length of the sequence used for comparison (nucleotide):similarity (%), 
the Accession No. of the top hit data. 

25 [0299] Blank indicates that the 3'-end sequence corresponding to the 5'-end was not determined in the clone. Data 
were not shown for the clones in which the P-value was higher than 1 . 

R-HEMBA1 000006 

30 R-HEMBA1000121 

Human DNA sequence *** SEQUENCING IN PROGRESS *** from clone 796F18, WORKING DRAFT SE- 
QUENCE. 
2.2e-43:355:80 
AL031291 

35 

R-HEMBA1000128 

Homo sapiens chromosome X, PAC 671 D9, complete sequence. 

0.99:389:60 

AF031078 

40 

R-HEMBA1 000275 

Homo sapiens DNA sequence from PAC 958B3 on chromosome Xp22.11-Xp22.22. Contains ESTs STS and CpG 
island. 

3.4e-1 0:21 2:66 
45 Z93023 

R-HEMBA1 000300 

{Alu RNA transcript, clone NE461} [human, embryonal carcinoma cells, NTera2D1 pluripotential cells, Other RNA, 
282 nt]. 
50 4.6e-42:246:89 
S42653 

R-nnnnnnnnnnnn 

Homo sapiens chromosome 17, clone hRPK.235_l_10, complete sequence. 
55 1.0e-71:1 92:95 

AC005922 

R-HEMBA1 000462 
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Homo sapiens JWA protein mRNA, complete cds. 

1.7e-1 07:533:97 

AF070523 

5 R-HEMBB1 000542 

Mus musculus bromodomain-containing protein BP75 mRNA, complete cds. 

4.4e-72:547:80 

AF084259 

W R-HEMBB1 000567 

Human insulin-like growth factor (IGF-II) gene, exon 1 of 4. 

4.3e-60:368:88 

M 13970 

15 R-HEMBB1 000642 

Human DNA sequence from PAC 46H23, BRCA2 gene region chromosome 13q1 2-1 3 contains Klotho, ESTs. 

2.9e-42:431:75 

Z84483 

20 R-HEMBB1 000668 

CITBI-E1-2508D15.TR CITBI-E1 Homo sapiens genomic clone 2508D15, genomic survey sequence. 

2.5e-40:249:91 

AQ261535 

25 R-HEMBB1 000679 

HS_3061 _A1 _C03_T7 CIT Approved Human Genomic Sperm Library D Homo sapiens genomic clone Piate=3061 
Col=5 Row=E, genomic survey sequence. 
1 .8e-48:257:96 
AQ1 27602 

30 

R-HEMBB1 000881 

CIT-HSP-2350020.TR CIT-HSP Homo sapiens genomic clone 2350020, genomic survey sequence. 

0.0072:248:61 

AQ062620 

35 

R-HEMBB1 000905 

Homo sapiens clone RG315H11, WORKING DRAFT SEQUENCE, 5 unordered pieces. 

2.5e-1 04:547:94 

AC005089 

40 

R-HEMBB1001026 

R-HEMBB1001048 

45 R-HEMBB1001200 

P.falciparum complete gene map of plastid-like DNA (IR-A). 

1.5e-1 1:521:59 

X95275 

50 R-HEMBB1 001407 

Human DNA sequence *" SEQUENCING IN PROGRESS *** from clone 2705, WORKING DRAFT SEQUENCE. 

3.0e-29:308:77 

AL033529 

55 R-HEMBB1001530 

Homo sapiens chromosome 19, cosmid R30538, complete sequence. 

0.040:373:63 

AC005943 
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Homology search result 8. 

[0300] The result of the homology search in the Human Unigene(http://www.ncbi.nlm.nih.gov/UniGene) using the 
clone sequences of the 5'-ends. 

5 

Indicated are from the top, 

the name of the clone sequence, 

title of the top hit data, 

the P-value: the length of the sequence used for comparison (nucteotide):similarity (%), 
10 the Accession No. of the top hit data. 

[0301] Data were not shown for the clones in which the P-value was higher than 1 . 

F-BNGH41 000020 
15 ESTs 

6.6e-72:412:92 
Hs.153375:AI287812 

F-BNGH41 000087 
20 Homo sapiens mRNA for MIFR-1 , complete cds 

0.027:499:57 
Hs.58269:AB010962 

F-BNGH41 000091 

25 Homo sapiens voltage-gated potassium channel eag (EAG) mRNA, complete cds 

5.2e-81:687:76 
Hs.158305:AJ001366 

F-HEMBA1 000006 

30 ESTs, Weakly similar to HYPOTHETICAL 51.2 KD PROTEIN IN LAG1-RPL14B INTERGENIC REGION [S.cere- 

visiae] 

2.0e-25:1 67:91 
Hs.9252:R53360 

35 F-HEMBA1000121 

ESTs, Moderately similar to HYPOTHETICAL 68.7 KD PROTEIN ZK757.1 IN CHROMOSOME III [Caenorhabditis 

elegans] 

3.0e-34:1 80:98 

Hs.149509:N24022 

40 

F-HEMBA1000128 
EST 

0.00069:177:62 
Hs.158854:AI377837 

45 

F-HEMBA1 000275 

Human modulator recognition factor I (MRF-1) mRNA, 3'end 

0.012:508:58 

Hs.920:M62324 

50 

F-HEMBA1 000300 

Human mRNA for KIAA0355 gene, complete cds 

1.6e-46:402:78 

Hs.153014:AB002353 

55 

F-HEMBA1 000349 
EST 

6.7e-08:65:95 
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F-HEMBB1 000567 
ESTs 

8.8e-13:271:71 
Hs.19934:AA455673 

5 

F-HEMBB1 000642 

F-HEMBB1 000668 
EST 

10 0.83:192:58 

Hs.126372:AA912193 

F-HEMBB1 000679 
H.sapiens mRNA for TRAMP protein 
is 4.1e-96:727:80 
Hs.4147:X63679 

F-HEMBB1 000881 

Homo sapiens chromosome 4p homeobox mRNA sequence 
20 2.2e-06:512:60 

Hs.104134:M99587 

F-HEMBB1 000905 

Homo sapiens mRNA for voltage gated potassium channel 
25 0.93:337:58 

Hs.4975:Y15065 

F-HEMBB1001026 
Human p76 mRNA, complete cds 
30 6. Ie-08:410:61 

Hs.28757:U81006 

F-HEMBB1001048 

Human Hpast (HPAST) mRNA, complete cds 
35 2.1e-56:524:75 

Hs.155119:AF001434 

F-HEMBB 1001200 
EST 

40 0.10:300:61 

Hs.161647:AA133367 

F-HEMBB1001407 

Homo sapiens PRKY exon 1 and joined CDS 
45 2.6e-40:271:81 

Hs.56336:Y15801 

F-HEMBB1001530 
ESTs 

50 1.2e-98:477:98 

Hs.135208:AI093908 

F-HEMBB1001547 

55 F-HEMBB1001573 
EST 

2.2e-06:115:75 
Hs.138275:R43976 
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B94 PROTEIN 
5.7e-13:469:65 
Hs.75522:M92357 

5 F-Y79AA1 002058 

Homo sapiens clone 24733 mRNA sequence 

1.7e-1 54:740:98 

Hs.21970:AF052149 

10 F-Y79AA1002121 
EST 

0.14:104:66 
Hs.100070:M91493 

15 F-Y79AA1002129 
ESTs 

5.1e-90:431:98 
Hs.40719:AI183452 

20 F-Y79AA1 002213 

F-Y79AA1 002334 
ESTs 

5.0e-20:1 87:80 
25 Hs.111900:AA397579 

F-Y79AA1 002373 
ESTs 

4.5e-37:1 92:98 
30 Hs.118559:AA887084 

F-Y79AA1 002376 

Homo sapiens cytoplasmic dynein intermediate chain 1 mRNA, complete cds 
1.2e-36:657:64 
35 Hs.65248:AF063228 

F-Y79AA1 002378 

Homo sapiens KIAA0426 mRNA, complete cds 
4.9e-38:424:72 
40 Hs.97476:AB007886 

F-Y79AA1 002381 

CELL DIVISION PROTEIN KINASE 3 
8.4e-1 7:580:61 
45 Hs.100009:X66357 

Homology search result 9 

[0302] The result of the homology search in the Human Unigene(http://www.ncbi.nlm. nih.gov/UniGene) using the 
50 clone sequences of the 3'-ends. 

Indicated are from the top, 

the name of the clone sequence, 

title of the top hit data, 

55 the P-value: the length of the sequence used for comparison (nucleotide):similarity (%), 

the Accession No. of the top hit data. 

[0303] Blank indicates that the 3-end sequence corresponding to the 5' -end was not determined in the clone. 
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R-HEMBB1 000567 

Insulin-like growth factor 2 (somatomedin A) 

8.9e-61:369:88 

Hs.155487:J03242 

5 

R-HEMBB1 000642 
ESTs 

2.2e-44:308:84 
Hs.141318:N71080 

10 

R-HEMBB1 000668 

ESTs, Weakly similar to hTAFIHOO [H.sapiens] 
2.5e-1 02:520:95 
Hs.3830:AA1 67691 

15 

R-HEMBB1 000679 
ESTs 

6.7e-36:1 88:97 

Hs.1 5421 8:AA1 69554 

20 

R-HEMBB1 000881 
ESTs 

8.4e-1 05:51 9:96 
Hs.110967:AA570505 

25 

R-HEMBB1 000905 
ESTs 

1.1e-94:454:98 
Hs.52515:AA464314 

30 

R-HEMBB1001026 
ESTs 

0.22:93:69 

Hs.119510:AA630235 

35 

R-HEMBB1001048 
EST 

0.42:127:66 
Hs.147466:AI215091 

40 

R-HEMBB1001200 
ESTs 

3.7e-07:330:62 
Hs.10109:AI148628 

45 

R-HEMBB1001407 
MHC class II transactivator 
3.8e-35:414:71 
Hs.3076:U 18259 

50 

R-HEMBB1001530 
ESTs 

2.4e-95:455:98 
Hs.8956:AI146421 

55 

R-HEMBB1001547 
ESTs 

1.06-111:533:98 
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Homo sapiens DNA recombination and repair protein (MRE11B) mRNA, complete cds 

0.00075:456:59 

Hs.1 53855: AF022778 

5 R-Y79AA1002213 

Human mRNA for KIAA0392 gene, partial cds 

6.2e-45:304:85 

Hs.40100:AB002390 

10 R-Y79AA1 002334 

ESTs 

7.7e-91:495:92 
Hs.90804:W28091 

15 R-Y79AA1 002373 

Human kpni repeat mrna (cdna clone pcd-kpni-8), 3' end 

5.2e-98:545:91 

Hs.103948:K00627 

20 R-Y79AA1 002376 

ESTs 

2.0e-91:455:97 
Hs.153375:AI287812 

25 R-Y79AA1 002378 

ESTs, Highly similar to ZINC FINGER PROTEIN ZFP-35 [Mus musculus] 

9.4e-15:131:83 

Hs.20082:W89121 

30 R-Y79AA1 002381 

ESTs, Highly similar to CELL DIVISION CONTROL PROTEIN 2 HOMOLOG [Plasmodium falciparum (isolate k1/ 
thailand)] 
1.5e-1 04:531:95 
Hs.26322:AA1 56858 

35 

Homology search result 10 

[0305] Data obtained by the homology search for full length nucleotide sequences and deduced amino acid sequenc- 
es. In the result of the search shown below, both units, aa and bp, are used as length units for the sequences to be 
40 compared. Each data includes Clone name, Definition in matching data, P value, Length of sequence to be compared, 
Homology, and Accession number (No.) of matching data. These items are shown in this order, separated by a double- 
slash mark, //. 

C-HEMBA1000006//Homo sapiens mRNA; cDNA DKFZp564G1762 (from clone DKFZp564G1762).//0//1230bp// 
45 92%//AB026894 

C-nnnnnnnnnnnn//GAMETOGENESIS EXPRESSED PROTEIN G EG- 1547/2. 30 E-71//344aa//50%//P50636 
C-HEMBA1 0001 21//HYPOTHETICAL 68.7 KD PROTEIN ZK757. 1 IN CHROMOSOME II l.//4.80E-05//83aa//27%// 
P34679 

C-HEMBA1 0001 28//PATHOG EN ESI S-R ELATED PROTEIN 1 PRECURSOR (PR-1)7/3.20E-07//89aa//34%// 
50 P33154 

C-HEMBA1 000275 
C-HEMBA1 000300 

C-HEMBA1 000349//ATP-BINDING CASSETTE TRANSPORTER 1 7/5.30 E-65//352aa//39%//P4 1233 
C-HEMBA1000443//Homo sapiens CGI-96 protein mRNA, complete cds.//4.70E-129//686bp//91%//AF151854 
55 C-HEMBA1000590//Homo sapiens mRNA for matrilin-4, partial.//2.00E-273//1254bp//99%//AJ007581 

C-HEMBA1000634//Homo sapiens T-cell activation protein (PGR1) gene, complete cds7/0//994bp//99%// 
AF1 16272 

C-HEMBA1 00071 3//Homo sapiens 10kD protein (BC10) mRNA, complete cds7/0//1254bp//99%//AF053470 
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C-HEMBA1 000745//COLLAGEN ALPHA 1(1) CHAIN (FRAGMENTS).//2.00E-07//445aa//27%//P02454 
C-HEMBA1 000907 

C-HEMBA1000940//GAP JUNCTION ALPHA-3 PROTEIN (CONNEXIN 44) (CX44).//2.90E-39//362aa//31%// 
P41987 
5 C-HEMBA1 000962 

C-HEMBA1001221//AGRIN P RECUR SO R.//2.50E-25//294aa//29%//P3 1696 

C-HEMBA1001228//Human germline oligomeric matrix protein (COMP) mRNA, complete cds.//7.80E-286// 

1105bp//94%//L32137 

C-HEMBA1001297 

10 C-HEMBA1001390//Mus musculus polymerase l-transcrtpt release factor mRNA, complete cds.//2.50E-57// 

464bp//82%//AF036249 
C-HEMBA1001563 

C-HEMBA1001621//PROBABLE G PROTEIN-COUPLED RECEPTOR APJ.//3.50E-123//259aa//89%//P35414 
C-nnnnnnnnnnnn//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//2.40E-85//293aa//50%// 
15 P51523 

C-HEMBA1001878//Homo sapiens U5 snRNP-specific 40 kDa protein mRNA, complete cds.//0//1 488bp//99%// 
AF090988 

C-HEMBA1002131//PROCOLLAGEN-LYSINE,2-OXOGLUTARATE 5-DIOXYGENASE 1//4.10E-10//140aa// 
30%//P24802 

20 C-HEMBA1 0021 63//HYPOTHETICAL 40.7 KD PROTEIN IN DAK1-ORC1 INTERGENIC REGION.//9.40E-28// 

309aa//30%//Q04651 
C-HEMBA1002164 

C-HEMBA1002167//Rattus norvegicus neuroligin I mRNA, complete cds.//1 .30E-305//1 643bp//91 %//U22952 
C-HEMBA1 0021 78//PROCOLLAGEN-LYSINE,2-OXOGLUTARATE 5-DIOXYGENASE 1 PRECURSOR (EC 

25 1.14.11.4) (LYSYL HYDROXYLASE 1) (LH1).//3.70E-10//140aa//30%//P24802 

C-nnnnnnnnnnnn//Human glycyl-tRNA synthetase mRNA, complete cds.//0//2380bp//99%//U09587 
C-HEMBA1 0021 95//VEG ETATIBLE INCOMPATIBILITY PROTEIN HET-E-1 .//8.80E-23//221 aa//31 %//Q00808 
C-HEMBA1002227//Homo sapiens mRNA for 80K-L protein, complete cds.//0//1324bp//98%//D 10522 
C-HEMBA1 002239 

30 C-HEMBA1002316 
C-HEMBA1 002420 

C-HEMBA1002421//Human heparan sulfate proteoglycan (HSPG) core protein, 3' end.//0//2097bp//99%//J04621 
C-HEMBA1002524//Human MHC Class I region proline rich protein mRNA, complete cds.//0//1763bp//95%// 
U63336 

35 C-HEMBA1 002551 //PUTATIVE SERINE/THREONINE-PROTEIN KINASE PKWA (EC 2.7.1 .-) 7/9. 80E-08//1 1 0aa// 

37%//P49695 

C-HEMBA1 002767//Homo sapiens chromosome 1 p33-p34 beta-1 ,4-galactosyltransferase mRNA, complete cds.// 
0//1 497bp//99%//AF038660 

C-HEMBA1002992//UBIQUITIN-LIKE PROTEIN DSK2.//2.00E-21//216aa//35%//P48510 
40 C-HEMBA1003047//Homo sapiens intrinsic factor-B12 receptor precursor, mRNA, complete cds.//0//1 768bp// 

99%//AF034611 

C-HEMBA1003072//Gallus gallus single-strand DNA-binding protein csdp mRNA, partial cds.//3.30E-93//927bp// 
73%//U68380 

C-HEMBA1003101//Homo sapiens tyrosylprotein sutfotransferase-2 mRNA, complete cds.//0//1854bp//99%// 
45 AF049891 

C-HEMBA1003230//Homo sapiens fibulin-5.//5.60E-308//1398bp//99%//AJ 133490 
C-HEMBA1 003294 

C-HEMBA1 00331 5//Mus musculus mRNA for DNA helicase, complete cds.//6.30E-250//1426bp//88%//AB013912 
C-HEMBA1003392//Homo sapiens LDL receptor-related protein 6 (LRP6) mRNA, complete cds.//0//1721bp// 
50 100%//AF074264 

C-HEMBA1003399//MVP1 PROTEIN.//2.30E-15//279aa//23%//P40959 
C-HEMBA1 003487 

C-HEMBA1003530//S.scrofa mRNA for BM88 antigen.//1.20E-60//900bp//66%//X82027 
C-HEMBA1003602//Homo sapiens CGI-67 protein mRNA, complete cds.//3.50E-70//732bp//66%//AF151825 
55 C-HEMBA1 003732//SFT2 PROTEIN.//1 .50E-06//1 62aa//30%//P381 66 

C-HEMBA1003945//Homo sapiens hypothetical 43.2 Kd protein mRNA, complete cds.//8.90E-287//757bp//97%// 
AF077030 

C-HEMBA100411 0//Homo sapiens intersectin short form mRNA, complete cds.//0//2033bp//99%//AF064243 
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C-HEMBA1 004250//CADH ERIN-RELATED TUMOR SUPPRESSOR PRECURSOR (FAT PROTEIN).//6.40E-51// 
277aa//35%//P33450 

C-HEMBA1004391//TUMOR SUPPRESSOR PROTEIN DCC PRECURSOR7/5.60E-20//194aa//26%//P70211 
C-HEMBA1 004444//GLYCO PROTEIN 25L PRECURSOR (GP25L)7/4.60E-41//148aa//52%//P27869 
5 C-HEMBA1 004454 

C-HEMBA1 004505//MANNOSYL-OLIGOSACCHARIDE ALPHA-1 t 2-MANNOSIDASE ISOFORM 1 (EC 
3.2.1.113) (MAN(9)-ALPHA-MANNOSIDASE).//2.70E-45//239aa//43%//P53624 
C-HEMBA1 004797 

C-HEMBA1004982/n-ETRACYCLINE RESISTANCE PROTEIN, CLASS E (TETA(E))7/6.30E-10//149aa//26%// 
w Q07282 

C-HEMBA1005070//SKIN SECRETORY PROTEIN XP2 PRECURSOR (APEG PROTEIN).//1.10E-05//187aa// 
29%//P17437 

C-HEMBA1 005084//NON-RECEPTOR TYROSINE KINASE SPORE LYSIS A (EC 2.7.1.112) (TYROSINE- PRO- 
TEIN KINASE 1)7/1.20E-07//102aa//37%//P18160 
15 C-HEMBA1005145 
C-HEMBA1 005430 

C-HEMBA1 005449//GLU CO AMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA- GLUCOSI- 
DASE) (1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//5.40E-1 0//224aa//24%//P1 3983 
C-HEMBA1 005489 

20 C-HEMBA1 005522//COAGULATION FACTOR VII PRECURSOR (EC 3.4.21 .2 1)7/7.70 E-15//78aa//51%//P98 139 

C-HEMBA1 005545//MUSCARINIC ACETYLCHOLINE RECEPTOR M3.//0//2121aa//100%//U29589 
C-HEMBA1005698//Homo sapiens vesicle trafficking protein (SEC22C) mRNA, complete cds7/6.60E-163// 
753bp//99%//AF039568 
C-HEMBA1005913 

25 C-HEMBA1005929//H.sapiens mRNA for serine/threonine protein kinase EMK7/6.50E-92//1092bp//69%//X97630 

C-HEMBA1005945//Oryctolagus cuniculus peroxisomal Ca-dependent solute carrier mRNA, complete Gds.// 

1 .90E-44//666bp//65%//AF0041 61 

C-HEMBA1006016 

C-HEMBA1006171 
30 C-HEMBA1 006299 

C-HEMBA1 006311 

C-HEMBA1 006335 

C-HEMBA1006430//Human putative transmembrane protein precursor (B5) mRNA, complete cds7/2.40E-70// 
1108bp//65%//L38961 

35 C-HEMBA1006482//Homo sapiens h-scol (SCOI) mRNA, nuclear gene encoding mitochondrial protein, complete 

cds7/0//1 1 01 bp//98%//AF026852 

C-HEMBA1006572//ODD-SKIPPED PROTEIN 7/2. 60 E-39//85aa//83%//P23803 
C-HEMBA1 006707//Homo sapiens mRNA for matrilin-4, partial.//0//2003bp//99%//AJ007581 
C-HEMBA1 006724 

40 C-HEMBA1 006902//Homo sapiens mRNA for matrilin-4, partial.//4.80E-275//1 799bp//85%//AJ 007581 

C-HEMBA1 00691 6//Homo sapiens Grb14 mRNA, complete cds.//3.00E-277//1010bp//95%//L76687 

C-HEMBA1 006960 

C-HEMBA1007013 

C-HEMBA1 007057 
45 C-HEMBA1 007241 

C-HEMBA1 007291 

C-HEMBA1 007332 

C-HEMBB1 000276 

C-HEMBB1000447//Homo sapiens JWA protein mRNA, complete cds7/0//2059bp//99%//AF070523 

so C-HEMBB1 000642 

C-HEMBB1000668//Homo sapiens mRNA for KIAA0893 protein, complete cds.//0//2375bp//99%//AB020700 
C-HEMBB1000679//C.famiiiaris mRNA for TRAM-protein7/4.10E-210//1149bp//80%//X63678 
C-HEMBB1000881//Danio rerio mRNA for MINDIN2, complete cds7/1.70E-67//948bp//66%//AB006085 
C-HEMBB1000905//TRANSCRIPTIONAL REPRESSOR RCO-17/1.00E-11//311aa//27%//P78706 

55 C-HEMBB1001026//ENDOSOMAL P24A PROTEIN PRECURSOR (70 KD ENDOMEMBRANE PROTEIN) (PHE- 

ROMONE ALPHA-FACTOR TRANSPORTER) (ACIDIC 24 KD LATE ENDOCYTIC INTERMEDIATE COMPO- 
NENT) 7/5. 30 E-1 1 //1 42aa//30%//P32802 

C-HEMBB1001048//Human Hpast (HPAST) mRNA, complete cds7/6.50E-39//448bp//75%//AC000159 
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015127 

C-HEMBA1006430//Human putative transmembrane protein precursor (B5) mRNA, complete cds.//2.40E-70// 
1108bp//65%//L38961 

C-HEMBA1 006482//Homo sapiens h-sco1 (SC01 ) mRNA, nuclear gene encoding mitochondrial protein, complete 
5 cds.//0//1 1 01 bp//98%//AF026852 

C-HEMBA1006517 
C-HEMBA1 006544 

C-HEMBA1 006572//ODD-SKIPPED PROTEIN.//2.60E-39//85aa//83%//P23803 
C-HEMBA1006658//Homo sapiens mRNA for NIK, partial cds.//0//1 500 bp//98%//AB0 13385 
10 C-HEMBA1 006707//Homo sapiens mRNA for matrilin-4, partial.//0//2003bp//99%//AJ007581 

C-HEMBA1 006724 

C-HEMBA1006749//Homo sapiens mRNA for matrilin-4, partial.//1 .40E-275//1942bp//83%//AJ007581 
C-HEMBA1 006770//FLOWERING TIME CONTROL PROTEIN FCA.//1.20E-33//352aa//34%//O04425 
C-HEMBA1006902//Homo sapiens mRNA for matrilin-4, partial7/4.80E-275//1799bp//85%//AJ007581 
15 C-HEMBA1006912 

C-HEMBA1006916//Homo sapiens Grb14 mRNA, complete cds.//3.00E-277//1010bp//95%//L76687 
C-HEMBA1 006960 

C-HEMBA1007013//Musmusculussphingosine kinase (SPHKIb) mRNA, complete cds.//1.10E-14//412bp//63%// 
AF068749 
20 C-HEMBA1 007057 

C-HEMBA1 007063 

C-HEMBA1007226//Homo sapiens RPA-binding trans-activator (RBT1) mRNA, complete cds7/7.30E-273// 
1 242bp//99%//AF1 92529 

C-HEMBA1007241//HYPOTHETICAL24.5 KD PROTEIN IN SAP185-BCK1 INTERGENIC R EG I ON. 112. 70 E- 14// 
25 1 06aa//42%//P40857 

C-HEMBA1 007291 

C-HEMBA1007332//Homo sapiens mRNA for unr-interacting protein7/6.40E-83//266bp//98%//AJ010025 
C-HEMBB1000106//CELLULAR NUCLEIC ACID BINDING PROTEIN (CNBP).//1.60E-10//139aa//30%//P53996 
C-HEMBB1 000276 
30 C-HEMBB1 000309 

C-HEMBB1 000407 

C-HEMBB1000447//Homo sapiens JWA protein mRNA, complete cds.//0//2059bp//99%//AF070523 
C-HEMBB1000542//Mus musculus bromodomain-containing protein BP75 mRNA, complete cds.//2.60E-232// 
1 452bp//85%//AF084259 
35 C-HEMBB1 000567 

C-HEMBB1 000642 

C-HEMBB1000668//Homo sapiens mRNA for KIAA0893 protein, complete cds.//0//2375bp//99%//AB020700 
C-HEMBB1000679//C.familiaris mRNA for TRAM-protein.//4.10E-210//1149bp//80%//X63678 
C-HEMBB1000881//Danio rerio mRNA for MINDIN2, complete cds.//1.70E-67//948bp//66%//AB006085 
40 C-HEMBB1 000905/ITRANSCRIPTIONAL REPRESSOR RCO-1 7/1 .00E-11//311aa//27%//P78706 

C-HEMBB1 001 026//ENDOSOMAL P24A PROTEIN PRECURSOR (70 KD ENDOMEMBRANE PROTEIN) (PHE- 
ROMONE ALPHA-FACTOR TRANSPORTER) (ACIDIC 24 KD LATE ENDOCYTIC INTERMEDIATE COMPO- 
NENT)7/5.30E-11//142aa//30%//P32802 

C-HEMBB1 001 048//SARCALUMENIN PRECURSOR7/6.50E-18//154aa//33%//P 13666 
45 C-HEMBB1001200 
C-HEMBB1001407 

C-HEMBB1001530//SLS1 PROTEIN PRECURSOR7/9.80E-10//273aa//27%//Q99158 
C-HEMBB1001547//Homo sapiens CGI-02 protein mRNA, complete cds7/0//2311bp//99%//AF1 32937 
C-HEMBB1001573 

50 C-HEMBB1001847//NEUROGENIC PROTEIN BIG BRAIN 7/4.70 E-06//258aa//24%//P23645 

C-HEMBB1 001 959//CCAAT-BINDING TRANSCRIPTION FACTOR SUBUNIT A (CBF-A) (NF-Y PROTEIN CHAIN 
B) (NF-YB) (CAAT-BOX DNA BINDING PROTEIN SUBUNIT B)7/7.30E-14//97aa//38%//P25210 
C-HEMBB1001978 
C-HEMBB1 002039 

55 C-HEMBB1 002041 //Homo sapiens transmembrane protein TENB2 (TENB2) mRNA, complete cds7/0//1 746bp// 

99%//AF1 79274 

C-H EMBB1 002051 //Homo sapiens Ets transcription factor ESE-2b mRNA, complete cds7/1 .30E-95//454bp//99%// 
AF1 15403 
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NASE 1)7/1 .00E-77//359aa//44%//Q14012 
C-Y79AA1001013 

C-Y79AA1 001 056//Homo sapiens MAID protein mRNA, complete cds.//0//1475bp//99%//AF1 13535 
C-Y79AA1001062/A*UMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 1, ENDOTHELIAL (B12 PRO- 
5 TEIN)7/8.90E-12//132aa//38%//Q13829 

C-Y79AA1001090//NUCLEAR FACTOR NF-KAPPA-B P105 SUBUNIT (DNA-BINDING FACTOR KBF1) (EBP- 1) 
(NF-KAPPA-B1 P84/NF-KAPPA-B1 P98) [CONTAINS: NUCLEAR FACTOR NF- KAPPA-B P50 SUBUNIT] (FRAG- 
MENT) 7/4. 50 E-09//1 44aa//31 %//Q63369 

C-Y79AA1 001 21 2//Homo sapiens SL15 protein mRNA, complete cds7/6.30E-306//1388bp//99%//AF038961 
W C-Y79AA1001264//HYPOTHETICAL 39.9 KD PROTEIN T15H9.1 IN CHROMOSOME II PRECURSOR7/5.10E- 

1 06//351 aa//58%//Q1 0005 

C-Y79AA1001272//Homo sapiens retinoic acid repressible protein (RARG-1) mRNA, complete cds.//1.50E-183// 
867bp//98%//AF1 72066 

C-Y79AA1001328//Mus musculus mRNA for DII3 protein, complete cds7/1 .90E-263//1 988b p//79%//AB0 13440 
15 C-Y79AA1001426//ANION EXCHANGE PROTEIN 3 (CARDIAC/BRAIN BAND 3-LIKE PROTEIN) (CAE3/BAE3)7/ 

6.20E-66//609aa//31 %//P48751 

C-Y79AA1001427//Homo sapiens cytochrome b5 reductase 1 (B5R.1) mRNA, complete cds7/0//1 588bp//99%// 
AF1 69481 

C-Y79AA1001430//Homo sapiens mRNA for KIAA0469 protein, complete cds7/0//2943bp//99%//AB007938 
20 C-Y79AA1001523//Homo sapiens transcriptional intermediary factor 1 alpha mRNA, complete cds7/0//2263bp// 

99%//AF1 19042 

C-Y79AA1001530//Human beta-tubulin gene (5-beta) with ten Alu family members7/0//1920bp//98%//X00734 
C-Y79AA1001592 

C-Y79AA1001727//CELL SURFACE A33 ANTIGEN PRECURSOR7/1.10E-13//286aa//27%//G99795 
25 C-Y79AA1 001 787//PROBABLE CALCIUM-TRANSPORTING ATPASE 9 (EC 3.6.1 .38)7/1 .70E-1 33//544aa//37%// 

Q12697 

C-Y79AA1001793//Mus musculus mRNA for GSG1, complete cds.//3.70E-126//532bp//78%//D87325 
C-Y79AA1001795//Homo sapiens mRNA for GalT4 protein.//2.30E-250//1137bp//99%//Y 15061 
C-Y79AA1001799//MITOCHONDRIAL RNA SPLICING PROTEIN MSR4.//3.40E-54//182aa//39%//P23500 
30 C-Y79AA1 001 803//Homo sapiens secretogranin III mRNA, complete cds.//0//1 871 bp//99%//AF078851 

C-Y79AA1001863 

C-Y79AA1002022//POLIOVIRUS RECEPTOR HOMOLOG P RECUR SO R 7/2. 20E-06//1 40aa//26%//P32507 
C-Y79AA1002058//Mus musculus Gng3lg mRNA, complete cds.//4.10E-167//1145bp//83%//AF069954 
C-Y79AA1 0021 21//HISTON E H 1 7/4.90E-1 2//1 1 4aa//35%//P35060 
35 C-Y79AA1002129 

C-Y79AA1 00221 3//HYPOTHETICAL 52.7 KD PROTEIN C38C10.2 IN CHROMOSOME 1117/1 .20E-98//262aa// 
41%//Q03567 

C-Y79AA1002334//GLUCOSE REPRESSION MEDIATOR PROTEIN.//1.70E-10//333aa//23%//P14922 
C-Y79AA1002373//Mus musculus mRNA for GSG1, complete cds.//7.20E-147//680bp//79%//D87325 
40 C-Y79AA1002376//Rattus norvegicus cytoplasmic dynein intermediate chain 2B mRNA, complete cds.//1.50E- 

304//1 667bp//90%//U39045 

C-Y79AA1002378//Homo sapiens zinc finger protein NY-REN-21 antigen mRNA, partial cds7/0//963bp//99%// 
AF1 55100 

C-Y79AA1002381//Homo sapiens cell cycle related kinase mRNA, complete cds.//0//1791bp//98%//AF035013 

45 

Claims 

1 . Use of an oligonucleotide as a primer for synthesizing the polynucleotide comprising the nucleotide sequence set 
50 forth in any one of SEQ ID NOs: 1 -829 and 2545, or the complementary strand thereof, wherein said oligonucleotide 

is complementary to said polynucleotide or the complementary strand thereof and comprises at least 1 5 nucle- 
otides. 

2. A primer set for synthesizing polynucleotides, the primer set comprising an oligo-dT primer and an oligonucleotide 
55 complementary to the complementary strand of the polynucleotide comprising the nucleotide sequence set forth 

in any one of SEQ ID NOs: 1-829 and 2545, wherein said oligonucleotide comprises at least 1 5 nucleotides. 

3. A primer set for synthesizing polynucleotides, the primer set comprising a combination of an oligonucleotide com- 
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prising a nucleotide sequence complementary to the complementary strand of the polynucleotide comprising a 5'- 
end nucleotide sequence and an oligonucleotide comprising a nucleotide sequence complementary to the poly- 
nucleotide comprising a 3'-end nucleotide sequence, wherein said oligonucleotides comprise at least 15 nucle- 
otides and wherein said combination of 5'-end nucleotide sequence.3'-end nucleotide sequence is selected from 
the group consisting of: 



SEQ ID NO:4 and SEQ ID NO:830 
SEQ ID NO:5 and SEQ ID NO:831 
SEQ ID NO:6 and SEQ ID NO:832 
SEQ ID NO:7 and SEQ ID NO:833 
SEQ ID NO:8 and SEQ ID NO:834 
SEQ ID NO:9 and SEQ ID NO:835 
SEQ ID NO: 1 1 and SEQ ID NO:836 
SEQ ID NO: 12 and SEQ ID NO:837 
SEQ ID NO: 13 and SEQ ID NO:838 
SEQ ID NO: 14 and SEQ ID NO:839 
SEQ ID NO: 15 and SEQ ED NO:840 
SEQ ED NO: 16 and SEQ ED NO:841 
SEQ ID NO: 17 and SEQ ED NO:842 
SEQ ID NO: 18 and SEQ ID NO:843 
SEQ ED NO:20 and SEQ ID NO:844 
SEQ ID NO:22 and SEQ ID NO:845 
SEQ ID NO:23 and SEQ ID NO:846 
SEQ ID NO:24 and SEQ ID NO:847 
SEQ ID NO:25 and SEQ ID NO:848 
SEQ ID NO:26 and SEQ ID NO:849 
SEQ ED NO:27 and SEQ ID NO:850 
SEQ ID NO:28 and SEQ ED NO:85 1 
SEQ ID NO:29 and SEQ ID NO:852 
SEQ ED NO-.30 and SEQ ID NO:853 
SEQ ID NO:31 and SEQ ID NO:854 
SEQ ID NO:32 and SEQ ED NO:855 
SEQ ID NO:33 and SEQ ED NO:856 
SEQ ID NO:34 and SEQ ID NO:857 
SEQ ID NO:35 and SEQ ID NO:858 
SEQ ED NO:36 and SEQ ID NO:859 
SEQ ED NO:37 and SEQ ID NO:860 
SEQ ID NO:39 and SEQ ID NO:861 
SEQ ID NO:40 and SEQ ID NO:862 
SEQ ID NO:41 and SEQ ID NO:863 
SEQ ID NO:42 and SEQ ED NO:864 
SEQ ID NO:44 and SEQ ID NO:865 
SEQ ED NO:45 and SEQ ID NO:866 
SEQ ED NO:46 and SEQ ID NO:867 
SEQ ID NO:47 and SEQ ID NO:868 
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SEQ ID NO:48 and SEQ ID NO:869 
SEQ ID NO:49 and SEQ ID NO:870 
SEQ ID NO:50 and SEQ ID NO:871 
SEQ ID NO:52 and SEQ ID NO:872 
SEQ ID NO:53 and SEQ ID NO:873 
SEQ ID NO:54 and SEQ ID NO:874 
SEQ ID NO:55 and SEQ ID NO:875 
SEQ ID NO:56 and SEQ ED NO:876 
SEQ ID NO:57 and SEQ ID NO:877 
SEQ ID NO:58 and SEQ ID NO:878 
SEQ ID NO:59 and SEQ ID NO:879 
SEQ ID NO:60 and SEQ ID NO:880 
SEQ ID NO:61 and SEQ ID NO:881 
SEQ ID NO:62 and SEQ ID NO:882 
SEQ ID NO:63 and SEQ ID NO:883 
SEQ ID NO:64 and SEQ ID NO:884 
SEQ ID NO:65 and SEQ ID NO:885 
SEQ ID NO:66 and SEQ ID NO:886 
SEQ ID NO:67 and SEQ ID NO:887 
SEQ ID NO:69 and SEQ ID NO:888 
SEQ ED NO:70 and SEQ ID NO:889 
SEQ ID NO:71 and SEQ ID NO:890 
SEQ ID NO:72 and SEQ ID NO:891 
SEQ ID NO:73 and SEQ ID NO:892 
SEQ ID NO:74 and SEQ ID NO:893 
SEQ ED NO:75 and SEQ ID NO:894 
SEQ ID NO:76 and SEQ ID NO.895 
SEQ ID NO:77 and SEQ ID NO:896 
SEQ ID NO:78 and SEQ ID NO:897 
SEQ ID NO:79 and SEQ ED NO:898 
SEQ ID NO:80 and SEQ ID NO:899 
SEQ ID NO:81 and SEQ ID NO:900 
SEQ ID NO:82 and SEQ ID NO.901 
SEQ ID NO:83 and SEQ ID NO:902 
SEQ ID NO:84 and SEQ ID NO:903 
SEQ ID NO:85 and SEQ ID MO:904 
SEQ ID NO:86 and SEQ ID NO:905 
SEQ ID NO:87 and SEQ ID NO:906 
SEQ ID NO:88 and SEQ ID NO:907 
SEQ ID NO:89 and SEQ ID NO:908 
SEQ ID NO:90 and SEQ ID NO:909 
SEQ ID NO:91 and SEQ ID NO:910 
SEQ ID NO:92 and SEQ ID NO:9l 1 
SEQ ED NO:93 and SEQ ED N0.912 
SEQ ID NO:94 and SEQ ED NO:913 
SEQ ID NO:95 and SEQ ID NO:914 
SEQ ED NO:96 and SEQ ID NO:915 
SEQ ID NO:97 and SEQ ID NO:916 
SEQ ID NO:98 and SEQ ID NO:917 
SEQ ID NO:99 and SEQ ID NO:918 
SEQ ID NO: 100 and SEQ ID NO:919 
SEQ ID NO: 101 and SEQ ID NO:920 
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SEQ ID NO: 102 and SEQ ID NO:921 
SEQ ID NO: 103 and SEQ ID N0.922 
SEQ ID NO: 104 and SEQ ID NO:923 
SEQ ID NO: 105 and SEQ ID NO:924 
SEQ ID NO: 106 and SEQ ID NO:925 
SEQ ID NO: 107 and SEQ ID NO:926 
SEQ ID NO: 108 and SEQ ID NO:927 
SEQ ID NO: 109 and SEQ ID NO:928 
SEQ ID NO: 1 10 and SEQ ID NO:929 
SEQ ID NO: 1 1 1 and SEQ ID NO:930 
SEQ ID NO: 1 12 and SEQ ID NO:931 
SEQ ID NO: 1 13 and SEQ ID NO:932 
SEQ ID NO: 1 14 and SEQ ID NO:933 
SEQ ID NO: 1 15 and SEQ ID NO:934 
SEQ ID NO: 1 16 and SEQ ID N0.935 
SEQ ID NO: 1 17 and SEQ ID NO:936 
SEQ ID NO: 1 18 and SEQ ID NO:937 
SEQ ID NO: 1 19 and SEQ ED N0:938 
SEQ ED NO: 120 and SEQ ED NO:939 
SEQ ID NO: 1 22 and SEQ ED NO:940 
SEQ ID NO: 123 and SEQ ED NO:94 1 
SEQ ID NO: 124 and SEQ ID NO:942 
SEQ ID NO: 125 and SEQ ID NO:943 
SEQ ID NO: 126 and SEQ ID NO:944 
SEQ ED NO: 1 27 and SEQ ID NO:945 
SEQ ID NO: 128 and SEQ ID NO:946 
SEQ ED NO: 129 and SEQ ED NO:947 
SEQ ED NO: 130 and SEQ ID NO:948 
SEQ ED NO: 13 1 and SEQ ED N0.949 
SEQ ID NO: 132 and SEQ ID NO:950 
SEQ ID NO: 133 and SEQ ID NO:951 
SEQ ED NO: 134 and SEQ ID NO:952 
SEQ ED NO: 135 and SEQ ID NO:953 
SEQ ED NO: 136 and SEQ ED NO:954 
SEQ ED NO: 137 and SEQ ED N0.955 
SEQ ID NO: 138 and SEQ ID NO:956 
SEQ ID NO: 139 and SEQ ID NO:957 
SEQ ID NO: 140 and SEQ ID NO:958 
SEQ ED NO: 141 and SEQ ID NO:959 
SEQ ED NO: 142 and SEQ ID NO:960 
SEQ ED NO: 143 and SEQ ED NO:961 
SEQ ED NO: 144 and SEQ ID NO:962 
SEQ ID NO: 145 and SEQ ED NO:963 
SEQ ID NO: 146 and SEQ ID NO:964 
SEQ ED NO: 147 and SEQ ID N0.965 
SEQ ID NO: 148 and SEQ ID NO:966 
SEQ ID NO: 149 and SEQ ED NO:967 
SEQ ID NO: 150 and SEQ ED NO:968 
SEQ ID NO: 151 and SEQ ID NO:969 
SEQ ID NO: 152 and SEQ ID NO:970 
SEQ ID NO: 153 and SEQ ID NO:971 
SEQ ID NO: 154 and SEQ H) NO:972 
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SEQ ID NO: 155 and SEQ ID NO:973 
SEQ ID NO: 156 and SEQ ID NO:974 
SEQ ID NO: 157 and SEQ ID NO:975 
SEQ ID NO: 158 and SEQ ID N0.976 
SEQ ID NO: 159 and SEQ ID NO:977 
SEQ ID NO: 160 and SEQ ID NO:978 
SEQ ID NO: 161 and SEQ ID NO:979 
SEQ ID NO: 162 and SEQ ID NO:980 
SEQ ID NO: 163 and SEQ ID NO:981 
SEQ ID NO: 164 and SEQ ID NO:982 
SEQ ID NO: 165 and SEQ ID NO:983 
SEQ ID NO: 166 and SEQ ID NO:984 
SEQ ID NO: 167 and SEQ ED NO:985 
SEQ ID NO: 168 and SEQ ID NO:986 
SEQ ID NO: 169 and SEQ ID NO:987 
SEQ ID NO: 170 and SEQ ID NO:988 
SEQ ID NO: 171 and SEQ ID NO:989 
SEQ ID NO: 172 and SEQ ID NO:990 
SEQ ID NO: 173 and SEQ ID NO:991 
SEQ ID NO: 174 and SEQ ID NO:992 
SEQ ID NO: 175 and SEQ ID NO:993 
SEQ ID NO: 176 and SEQ ID N0.994 
SEQ ID NO: 177 and SEQ ID NO:995 
SEQ ID NO: 178 and SEQ ID NO:996 
SEQ ID NO: 179 and SEQ ID NO:997 
SEQ ID NO: 1 80 and SEQ ID NO:998 
SEQ ID NO:181 and SEQ ID NO:999 
SEQ ID NO:182 and SEQ ID NO: 1000 
SEQ ID NO: 183 and SEQ ID NO: 1001 
SEQ ID NO: 184 and SEQ ID NO: 1002 
SEQ ID NO: 185 and SEQ ID NO: 1003 
SEQ ID NO: 186 and SEQ ID NO: 1004 
SEQ ID NO: 187 and SEQ ID NO: 1005 
SEQ ID NO: 188 and SEQ ID NO: 1006 
SEQ ID NO: 189 and SEQ ID NO: 1007 
SEQ ID NO: 190 and SEQ ID NO: 1008 
SEQ ID NO: 191 and SEQ ID NO: 1009 
SEQ ID NO: 193 and SEQ ID NO: 1010 
SEQ ID NO: 194 and SEQ ID NO: 101 1 
SEQ ID NO: 195 and SEQ ID NO: 1012 
SEQ ID NO: 196 and SEQ ID NO: 101 3 
SEQ ID NO: 197 and SEQ ID NO: 1014 
SEQ ID NO: 198 and SEQ ID NO: 1015 
SEQ ID NO: 199 and SEQ ID NO: 1016 
SEQ ID NO:200 and SEQ ID NO: 1017 
SEQ ID NO:201 and SEQ ID NO: 1018 
SEQ ID NO:202 and SEQ ID NO: 101 9 
SEQ ID NO:203 and SEQ ID NO: 1020 
SEQ ID NO:204 and SEQ ID NO: 1021 
SEQ ED NO:205 and SEQ ID NO: 1022 
SEQ ID NO:207 and SEQ ID NO: 1023 
SEQ ID NO:208 and SEQ E> NO: 1024 
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SEQ ID NO:209 and SEQ ID NO: 1025 
SEQ ID NO:210 and SEQ ID NO: 1026 
SEQ ID N0:21 1 and SEQ ID NO: 1027 
SEQ ID NO:212 and SEQ ID NO: 1028 
SEQ ID NO:213 and SEQ ID NO: 1029 
SEQ ID NO:214 and SEQ ID NO: 1030 
SEQ IDNO:215 and SEQ ID NO: 1031 
SEQ ID NO:216 and SEQ ID NO: 1032 
SEQ ID NO:248 and SEQ ID NO: 1033 
SEQ ID NO:249 and SEQ ID NO: 1034 
SEQ ID NO:250 and SEQ ID NO: 1035 
SEQ ID NO:251 and SEQ ID NO: 1036 
SEQ ID NO:252 and SEQ ID NO: 1037 
SEQ ID NO:253 and SEQ ID NO: 1038 
SEQ ID NO:254 and SEQ ID NO: 1039 
SEQ ID NO:255 and SEQ ID NO: 1040 
SEQ ID NO:256 and SEQ ID NO: 1041 
SEQ ID NO:257 and SEQ ID NO: 1042 
SEQ ID NO:258 and SEQ ID NO: 1043 
SEQ ID NO:259 and SEQ ID NO: 1044 
SEQ ID NO:260 and SEQ ID NO: 1045 
SEQ ID NO:261 and SEQ ID NO: 1046 
SEQ ID NO:262 and SEQ ID NO: 1047 
SEQ ID NO:263 and SEQ ID NO: 1048 
SEQ ID NO:264 and SEQ ID NO: 1049 
SEQ ID NO:265 and SEQ ED NO: 1050 
SEQ ID NO:266 and SEQ ID NO: 105 1 
SEQ ID NO:267 and SEQ ID NO: 1052 
SEQ ID NO:268 and SEQ ID NO: 1053 
SEQ ID NO:29 1 and SEQ ID NO: 1054 
SEQ ID NO:292 and SEQ ID NO: 1055 
SEQ ID NO:293 and SEQ ID NO: 1056 
SEQ ID NO:294 and SEQ ED NO: 1057 
SEQ ID NO:295 and SEQ ID NO: 1058 
SEQ ID NO:296 and SEQ ID NO: 1059 
SEQ ID N0.297 and SEQ ID NO: 1060 
SEQ ID NO:298 and SEQ ID NO: 1061 
SEQ ID NO:300 and SEQ ID NO: 1062 
SEQ ID NO:301 and SEQ ID NO: 1063 
SEQ ID NO:302 and SEQ ID NO: 1064 
SEQ ID NO:304 and SEQ ID NO: 1065 
SEQ ID NO:305 and SEQ ID NO: 1066 
SEQ ID NO:306 and SEQ ID NO: 1067 
SEQ ID NO:307 and SEQ ID NO: 1068 
SEQ ID NO:308 and SEQ ID NO: 1069 
SEQ ID NO:309 and SEQ ID NO: 1070 
SEQ ID NO:310 and SEQ ED NO: 1071 
SEQ ID NO:31 1 and SEQ ID NO: 1072 
SEQ ID NO:312 and SEQ ID NO: 1073 
SEQ ID NO:313 and SEQ ID NO: 1074 
SEQ ID NO:314 and SEQ ID NO: 1075 
SEQ ID NO:31 5 and SEQ ID NO: 1076 
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SEQ ID NO:317 and SEQ ID NO: 1077 
SEQ ID NO:318 and SEQ ID NO:1078 
SEQ ID NO: 3 19 and SEQ ID NO: 1079 
SEQ ID NO:320 and SEQ ID NO: 1080 
SEQ ID NO:321 and SEQ ID NO: 1081 
SEQ ID NO:322 and SEQ ID NO: 1082 
SEQ ID NO:323 and SEQ ID NO: 1083 
SEQ ID NO: 324 and SEQ ID NO: 1084 
SEQ ID NO:325 and SEQ ID NO: 1085 
SEQ ID NO:326 and SEQ ID NO: 1086 
SEQ ID NO:327 and SEQ ID NO: 1087 
SEQ ID NO:328 and SEQ ID NO: 1088 
SEQ ID NO:329 and SEQ ID NO: 1089 
SEQ ID NO:330 and SEQ ID NO: 1090 
SEQ ID NO:331 and SEQ ID NO: 1091 
SEQ ID NO:332 and SEQ ID NO: 1092 
SEQ ID NO:333 and SEQ ID NO: 1093 
SEQ ID NO:334 and SEQ ID NO: 1094 
SEQ ID NO:335 and SEQ ID NO: 1095 
SEQ ID NO:336 and SEQ ID NO: 1096 
SEQ ID NO:337 and SEQ ED NO: 1097 
SEQ ID NO:338 and SEQ ID NO: 1098 
SEQ ID NO:339 and SEQ ID NO: 1099 
SEQ ID NO:340 and SEQ ID NO:l 100 
SEQ ID NO:341 and SEQ ID NO:1101 
SEQ ID NO: 342 and SEQ ID NO: 11 02 
SEQ ID NO-.343 and SEQ ID NO: 1 103 
SEQ ID NO:344 and SEQ ID NO: 1 104 
SEQ ID NO:345 and SEQ ED NO: 1 105 
SEQ ID NO:347 and SEQ ED NO: 1 106 
SEQ ID NO:348 and SEQ ED NO:l 107 
SEQ ID NO:349 and SEQ ID NO:l 108 
SEQ ID NO:350 and SEQ ID NO: 1 109 
SEQ ID NO:351 and SEQ ID NO: 1 1 10 
SEQ ED NO:352 and SEQ ID NO: 1 1 1 1 
SEQ ID NO:353 and SEQ ID NO: 1 1 12 
SEQ ID NO:354 and SEQ ID NO:l 1 13 
SEQ ID NO:355 and SEQ ID NO: 1 1 14 
SEQ ID NO:356 and SEQ ID NO:l 1 15 
SEQ ID NO:357 and SEQ ID NO: 1 1 16 
SEQ ID NO:358 and SEQ ID NO: 1 1 17 
SEQ ED NO:359 and SEQ ID NO: 1 1 18 
SEQ ED NO:360 and SEQ ED NO: 1 1 19 
SEQ ID NO:361 and SEQ ID NO:l 120 
SEQ ID NO:362 and SEQ ID NO: 1 121 
SEQ ED NO:363 and SEQ ID NO: 1 122 
SEQ ID NO:364 and SEQ ID NO: 1 123 
SEQ ID NO:365 and SEQ ID NO: 1 124 
SEQ ED NO:366 and SEQ ID NO:l 125 
SEQ ID NO:367 and SEQ ID NO: 1 126 
SEQ ID NO:368 and SEQ ID NO: 1 127 
SEQ ID NO:369 and SEQ ED NO: 1 128 
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SEQ ID NO:370 and SEQ ID NO: 1 129 
SEQ ID NO:371 and SEQ ID NO:l 130 
SEQ ID NO:372 and SEQ ID NO: 1 131 
SEQ ID NO:373 and SEQ ID NO: 1 132 
SEQ ID NO:374 and SEQ ID NO: 1 133 
SEQ ID NO:375 and SEQ ID NO: 1 134 
SEQ ID NO:376 and SEQ ID NO:l 135 
SEQ ID NO:377 and SEQ ID NO: 1 136 
SEQ ID NO:378 and SEQ ID NO: 1 137 
SEQ ID NO:379 and SEQ ID NO: 1 138 
SEQ ID NO:380 and SEQ ID NO: 1 139 
SEQ ID NO:381 and SEQ ID NO: 1 140 
SEQ ED NO:382 and SEQ ID NO: 1 141 
SEQ ID NO:383 and SEQ ED NO: 1 142 
SEQ ID NO:384 and SEQ ID NO: 1 143 
SEQ ID NO:385 and SEQ ID NO: 1 144 
SEQ ID NO:386 and SEQ ID NO: 1 145 
SEQ ID NO:387 and SEQ ID NO: 1 146 
SEQ ID NO:388 and SEQ ID NO: 1 147 
SEQ ID NO:389 and SEQ ID NO: 1 148 
SEQ ID NO:390 and SEQ ID NO: 1 149 
SEQ ID NO:391 and SEQ ID NO: 1 150 
SEQ ID NO:392 and SEQ ID NO: 1 151 
SEQ ID NO:393 and SEQ ID NO: 1 152 
SEQ ID NO:394 and SEQ ID NO: 1 153 
SEQ ID N0.395 and SEQ ID NO: 1 154 
SEQ ID NO:396 and SEQ ID NO: 1 155 
SEQ ID NO:397 and SEQ ID NO: 1 156 
SEQ ID NO:398 and SEQ ID NO: 1 157 
SEQ ID NO:399 and SEQ ID NO: 1 158 
SEQ ID NO:400 and SEQ ID NO: 1 159 
SEQ ID NO:401 and SEQ ED NO: 1 160 
SEQ ID NO:402 and SEQ DO NO: 1 161 
SEQ ID NO:403 and SEQ ID NO: 1 162 
SEQ ID NO:404 and SEQ ID NO: 1 163 
SEQ ID NO:405 and SEQ ID NO: 1 164 
SEQ ID NO:406 and SEQ ID NO: 1 165 
SEQ ID NO:407 and SEQ ID NO: 1 166 
SEQ ID NO:408 and SEQ ID NO: 1 167 
SEQ ID NO:409 and SEQ ID NO: 1 168 
SEQ ID NO.410 and SEQ ED NO: 1 169 
SEQ ID NO:41 1 and SEQ ID NO: 1 170 
SEQ ID,NO:412 and SEQ ID NO:l 171 
SEQ ID NO:413 and SEQ ED NO: 1 172 
SEQ ID NO:414 and SEQ ID NO:l 173 
SEQ ID N0.415 and SEQ ID NO: 1 174 
SEQ ID NO:416 and SEQ ID NO:l 175 
SEQ ID NO:417 and SEQ ID NO:l 176 
SEQ ID NO-.419 and SEQ ID NO: 1 177 
SEQ ID NO:420 and SEQ ID NO: 1 178 
SEQ ID NO:421 and SEQ ID NO: 1 179 
SEQ ID NO:422 and SEQ ID NO: 1 180 
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SEQ ED NO:423 and SEQ ID NO:l 181 
SEQ ID NO:424 and SEQ ID NO: 1182 
SEQ ID NO:425 and SEQ ID NO: 11 83 
SEQ DD NO:426 and SEQ ID NO:l 184 
SEQ ID N0.427 and SEQ ID NO: 11 85 
SEQ ID NO:428 and SEQ ID NO: 1186 
SEQ ID NO:429 and SEQ ID NO: 11 87 
SEQ ID NO:430 and SEQ ID NO: 1 188 
SEQ ID NO:431 and SEQ ID NO: 11 89 
SEQ ID NO:432 and SEQ ID NO: 1 190 
SEQ ID NO:433 and SEQ ID NO: 1 19 1 
SEQ ID NO:434 and SEQ ID NO:l 192 
SEQ ID NO:435 and SEQ ID NO:l 193 
SEQ ID NO:436 and SEQ ID NO: 1 1 94 
SEQ ID NO:437 and SEQ ID NO: 1 195 
SEQ ID NO:438 and SEQ ID NO: 11 96 
SEQ ID NO:439 and SEQ ED NO: 1 197 
SEQ ID NO:440 and SEQ ID NO: 1 198 
SEQ ID NO:441 and SEQ ID NO:l 199 
SEQ ID NO:442 and SEQ ID NO: 1200 
SEQ ID NO:443 and SEQ ID NO: 1201 
SEQ ID NO:444 and SEQ ID NO: 1202 
SEQ ID NO:446 and SEQ ID NO: 1203 
SEQ ID NO:447 and SEQ ID NO: 1204 
SEQ ID NO.-448 and SEQ ID NO: 1205 
SEQ ID NO:450 and SEQ ID NO: 1206 
SEQ ID NO:451 and SEQ ID NO: 1207 
SEQ ID NO:452 and SEQ ID NO: 1208 
SEQ ID NO:453 and SEQ ID NO: 1209 
SEQ ID NO:454 and SEQ ID NO:1210 
SEQ ID NO:455 and SEQ ID NO: 121 1 
SEQ ID NO:456 and SEQ ID NO: 1212 
SEQ ID NO:457 and SEQ ID NO: 12 13 
SEQ ED NO:458 and SEQ ID NO: 1214 
SEQ ID NO:459 and SEQ ID N0.1215 
SEQ ID NO:460 and SEQ ID NO: 12 16 
SEQ ID NO:461 and SEQ ID NO:1217 
SEQ ID NO:462 and SEQ ID NO: 12 18 
SEQ ID NO:463 and SEQ ID NO: 1219 
SEQ ID NO:464 and SEQ ID NO: 1220 
SEQ ID NO:465 and SEQ ID NO: 1221 
SEQ ID N0.466 and SEQ ID NO: 1222 
SEQ ID NO:467 and SEQ ID NO: 1223 
SEQ ID NO:468 and SEQ ID NO: 1224 
SEQ ID N0.469 and SEQ ID NO: 1225 
SEQ ID NO:470 and SEQ ID NO: 1226 
SEQ ID NO:471 and SEQ ID NO:1227 
SEQ ID NO:472 and SEQ ID NO: 1228 
SEQ ID NO:473 and SEQ ID NO: 1229 
SEQ ID NO:474 and SEQ ID NO: 1230 
SEQ ID NO:475 and SEQ ID NO:1231 
SEQ ID NO:476 and SEQ ID NO: 1232 
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SEQ ID N0.477 and SEQ ID NO: 1233 
SEQ ID NO:478 and SEQ ID NO: 1234 
SEQ ID NO:479 and SEQ ED NO: 1235 
SEQ ED NO:480 and SEQ ID NO: 1236 
SEQ ED NO:481 and SEQ ED NO: 1237 
SEQ ID NO:482 and SEQ ED NO: 1238 
SEQ ED NO:483 and SEQ ID NO: 1239 
SEQ ID NO:484 and SEQ ID NO: 1240 
SEQ ID NO:485 and SEQ ID NO:1241 
SEQ ID NO:486 and SEQ ID NO: 1242 
SEQ ID NO:487 and SEQ ID NO: 1243 
SEQ ID NO:488 and SEQ ID NO: 1244 
SEQ ID NO:489 and SEQ ID NO: 1245 
SEQ ID NO:490 and SEQ ID NO: 1246 
SEQ ID NO:492 and SEQ ID NO: 1247 
SEQ ID NO:493 and SEQ ID NO: 1248 
SEQ ID NO:494 and SEQ ID NO: 1249 
SEQ ID NO:496 and SEQ ID NO: 1250 
SEQ ID NO-.497 and SEQ ID NO: 1251 
SEQ ID NO:498 and SEQ ID NO: 1252 
SEQ ID NO:499 and SEQ ED NO: 1253 
.SEQ ID NO:500 and SEQ ID NO: 1254 
SEQ ID NO:501 and SEQ ID NO: 1255 
SEQ ID NO:502 and SEQ ID NO: 1256 
SEQ ED NO:503 and SEQ ID NO: 1257 
SEQ ID NO:504 and SEQ ED NO: 1258 
SEQ ID NO:505 and SEQ ID NO: 1259 
SEQ ID NO:506 and SEQ ID NO: 1260 
SEQ ID NO:507 and SEQ ID NO: 1261 
SEQ ID NO:508 and SEQ ID NO: 1262 
SEQ ID NO:509 and SEQ ID NO: 1263 
SEQ ID NO:510 and SEQ ID NO: 1264 
SEQ ID NO:5 1 1 and SEQ ID NO: 1265 
SEQ ID NO:513 and SEQ ID N0.1266 
SEQ ID NO:514 and SEQ ID NO: 1267 
SEQ ID NO:515 and SEQ ID NO: 1268 
SEQ ED NO:516 and SEQ ID NO:1269 
SEQ ED NO:517 and SEQ ID NO: 1270 
SEQ ID NO:518 and SEQ ID NO:1271 
SEQ ID NO:519 and SEQ ID NO: 1272 
SEQ ID NO:520 and SEQ ID NO: 1273 
SEQ ED NO:52 1 and SEQ ID NO: 1274 
SEQ ED NO:522 and SEQ ID NO: 1275 
SEQ ID NO:523 and SEQ ID NO: 1276 
SEQ ID NO:524 and SEQ ID NO: 1277 
SEQ ID NO:525 and SEQ ID NO: 1278 
SEQ ID NO:526 and SEQ ID NO: 1279 
SEQ ID NO:527 and SEQ ID NO: 1280 
SEQ ID NO:528 and SEQ ID NO: 1281 
SEQ ID NO:529 and SEQ ID NO: 1282 
SEQ ID NO:530 and SEQ ID NO: 1283 
SEQ ID NO:531 and SEQ ID NO: 1284 
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SEQ ID NO:532 and SEQ ID NO: 1285 
SEQ ID NO:533 and SEQ ID NO: 1286 
SEQ ID NO:534 and SEQ ID NO: 1287 
SEQ ID NO-.535 and SEQ ID NO: 1288 
SEQ ID NO:536 and SEQ ID NO: 1289 
SEQ ID NO:537 and SEQ ID NO: 1290 
SEQ ID NO:538 and SEQ ID NO: 1291 
SEQ ID NO:539 and SEQ ID NO: 1292 
SEQ ID NO:540 and SEQ ID NO: 1293 
SEQ ID NO:541 and SEQ ID NO: 1294 
SEQ ID NO:542 and SEQ ID NO: 1295 
SEQ ID NO:543 and SEQ ID NO: 1296 
SEQ ID NO:544 and SEQ ID NO: 1297 
SEQ ID NO:545 and SEQ ID NO: 1298 
SEQ ID NO:546 and SEQ ID NO: 1299 
SEQ ID NO:547 and SEQ ID NO: 1300 
SEQ ID NO:548 and SEQ ID NO:1301 
SEQ ID NO:549 and SEQ ID NO: 1302 
SEQ ID NO:550 and SEQ ID NO: 1303 
SEQ ID NO:551 and SEQ ID NO: 1304 
SEQ ID NO:552 and SEQ ID NO: 1305 
SEQ ID NO:553 and SEQ ID NO: 1306 
SEQ ID NO:554 and SEQ ID NO: 1307 
SEQ ID NO:555 and SEQ ID NO: 1308 
SEQ ID NO:556 and SEQ ID NO: 1309 
SEQ ID NO:557 and SEQ ID NO: 13 10 
SEQ ID NO:558 and SEQ ID NO: 131 1 
SEQ ID NO:559 and SEQ ID NO: 1312 
SEQ ID NO:560 and SEQ ID NO:1313 
SEQ ID NO:561 and SEQ ID NO:1314 
SEQ ED NO:562 and SEQ ID NO: 13 15 
SEQ ID NO:563 and SEQ ID NO:1316 
SEQ ID NO:564 and SEQ ID NO: 13 17 
SEQ ID NO:565 and SEQ ID NO:1318 
SEQ ID N0.566 and SEQ ID NO: 13 19 
SEQ ID NO:567 and SEQ ID NO: 1320 
SEQ ID NO:569 and SEQ ID NO: 1321 
SEQ ID NO:570 and SEQ ID NO: 1 322 
SEQ ID NO:571 and SEQ ID NO: 1323 
SEQ ID NO:572 and SEQ ID NO: 1324 
SEQ ID NO:573 and SEQ ID NO: 1325 
SEQ ID NO:574 and SEQ ID NO: 1326 
SEQ ID NO:575 and SEQ ID NO: 1327 
SEQ ID NO:576 and SEQ ID NO:1328 
SEQ ID NO:577 and SEQ ID NO: 1329 
SEQ ID NO:579 and SEQ ID NO:1330 
SEQ ID NO:580 and SEQ ID NO: 1331 
SEQ ID NO:581 and SEQ ID NO:1332 
SEQ ID NO:582 and SEQ ID NO:1333 
SEQ ID NO:583 and SEQ ID NO: 1334 
SEQ ID NO:584 and SEQ ID NO: 1335 
SEQ ID NO:585 and SEQ ID NO: 1336 
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SEQ ID NO:586 and SEQ ID NO: 1337 
SEQ ID NO:587 and SEQ ID NO: 1338 
SEQ ID NO:588 and SEQ ID NO: 1339 
SEQ ID NO:589 and SEQ ID NO: 1340 
SEQ ID NO:590 and SEQ ID NO: 1341 
SEQ ID NO:591 and SEQ ID NO:1342 
SEQ ID NO:592 and SEQ ID NO: 1343 
SEQ ID NO:593 and SEQ ID NO: 1344 
SEQ ID NO:594 and SEQ ID NO: 1345 
SEQ ID NO:595 and SEQ ID NO: 1346 
SEQ ID NO:596 and SEQ ID NO: 1347 
SEQ ID NO:597 and SEQ ID NO: 1348 
SEQ ID NO:598 and SEQ ID NO: 1349 
SEQ ID NO:599 and SEQ ID NO: 1 350 
SEQ ID NO.600 and SEQ ID NO: 1 35 1 
SEQ ID NO:601 and SEQ ID NO: 1352 
SEQ ID NO:602 and SEQ ID NO: 1353 
SEQ ID NO:603 and SEQ ID NO: 1354 
SEQ ID NO:604 and SEQ ID NO: 1355 
SEQ ID NO:605 and SEQ ID NO: 1356 
SEQ ID NO:606 and SEQ ID NO: 1357 
SEQ ID NO:607 and SEQ ID NO: 1358 
SEQ ID NO:608 and SEQ ID NO:1359 
SEQ ID NO:609 and SEQ ID NO: 1360 
SEQ ID NO:610 and SEQ ID NO: 1361 
SEQ ID NO:6l 1 and SEQ ID NO: 1362 
SEQ ID NO:612 and SEQ ID NO: 1363 
SEQ ID NO:613 and SEQ ID NO: 1364 
SEQ ID NO:614 and SEQ ID NO: 1365 
SEQ ID NO:615 and SEQ ID NO:1366 
SEQ ID NO:616 and SEQ ID NO: 1367 
SEQ ID NO:617 and SEQ ID NO: 1368 
SEQ ID NO:618 and SEQ ED NO: 1369 
SEQ ID NO:619 and SEQ ID NO: 1370 
SEQ ID NO:620 and SEQ ED NO: 137 1 
SEQ ID NO:621 and SEQ ID NO: 1372 
SEQ ID NO:622 and SEQ ID NO: 1373 
SEQ ID NO:623 and SEQ ID NO: 1374 
SEQ ID NO:624 and SEQ ID NO: 1375 
SEQ ID NO:625 and SEQ ID NO: 1376 
SEQ ID NO:626 and SEQ ID NO: 1377 
SEQ ID NO:627 and SEQ ID NO:1378 
SEQ ID NO:628 and SEQ ID NO: 1 379 
SEQ ID NO:629 and SEQ ID NO:1380 
SEQ ID NO:630 and SEQ ID NO: 1381 
SEQ ID NO:63 1 and SEQ ID NO: 1382 
SEQ ID NO:632 and SEQ ID NO:1383 
SEQ ID NO:633 and SEQ ID NO:1384 
SEQ ID NO:634 and SEQ ID NO: 1 385 
SEQ ID NO:635 and SEQ ID NO: 1386 
SEQ ID NO:636 and SEQ ID NO: 1387 
SEQ ID NO:637 and SEQ ID NO:1388 
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SEQ ID NO:638 and SEQ ID NO: 1389 
SEQ ID NO:639 and SEQ ID NO: 1390 
SEQ IDNO:640 and SEQ ID NO: 1391 
SEQ ID N0.641 and SEQ ID NO:1392 
SEQ ID NO:642 and SEQ ID NO: 1393 
SEQ ID NO:643 and SEQ ID NO: 1394 
SEQ ID NO:644 and SEQ ID NO: 1395 
SEQ ID NO:645 and SEQ ID NO:1396 
SEQ ID NO:646 and SEQ ID NO: 1397 
SEQ ID NO:647 and SEQ ID NO: 1398 
SEQ ID NO:648 and SEQ ID NO: 1399 
SEQ ID NO:649 and SEQ ID NO: 1400 
SEQ ID NO:650 and SEQ ID NO: 1401 
SEQ ID NO:65 1 and SEQ ED NO: 1402 
SEQ ID NO:652 and SEQ ID NO: 1403 
SEQ ID NO:653 and SEQ ID NO: 1404 
SEQ ID NO:654 and SEQ ID NO: 1405 
SEQ ID NO:655 and SEQ ID NO: 1406 
SEQ ID NO:656 and SEQ ID NO: 1407 
SEQ ED NO:657 and SEQ ED NO: 1408 
SEQ ED NO:658 and SEQ ED NO: 1409 
SEQ ED NO:659 and SEQ ID NO: 1410 
SEQ ID NO:660 and SEQ ID NO: 141 1 
SEQ ID NO:661 and SEQ ED NO:1412 
SEQ ID NO:662 and SEQ ED NO: 1413 
SEQ ID NO:663 and SEQ ED NO: 1414 
SEQ ID NO:664 and SEQ ID NO: 1415 
SEQ ID NO:665 and SEQ ID NO: 1416 
SEQ ED NO:666 and SEQ ED NO: 1417 
SEQ ED NO-.667 and SEQ ED NO: 1418 
SEQ ID NO:668 and SEQ ED NO: 1419 
SEQ ID NO:669 and SEQ ED NO: 1420 
SEQ ID NO:670 and SEQ ID NO: 1421 
SEQ ID NO:671 and SEQ ID NO: 1422 
SEQ ID NO:672 and SEQ ID NO: 1423 
SEQ ED NO:673 and SEQ ID NO: 1424 
SEQ ID NO:674 and SEQ ID NO: 1425 
SEQ ID NO:675 and SEQ ED NO: 1426 
SEQ ID NO:676 and SEQ ED NO: 1427 
SEQ ID NO:677 and SEQ ED NO: 1428 
SEQ ID NO:678 and SEQ ID NO: 1429 
SEQ ID NO:679 and SEQ ID NO: 1430 
SEQ ED NO:680 and SEQ ID NO: 1431 
SEQ ED NO:681 and SEQ ID NO: 1432 
SEQ ED NO:682 and SEQ ID NO: 1433 
SEQ ID NO:683 and SEQ ED NO: 1434 
SEQ ID NO:684 and SEQ ED NO: 1435 
SEQ ID NO:685 and SEQ ID NO: 1436 
SEQ ID NO:686 and SEQ ID NO: 1437 
SEQ ID NO:687 and SEQ ID NO: 1438 
SEQ ID NO:688 and SEQ ID NO: 1439 
SEQ ID NO:689 and SEQ ID NO: 1440 
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SEQ ID NO:690 and SEQ ID NO: 1441 
SEQ ID NO:691 and SEQ ID NO: 1442 
SEQ ID NO:692 and SEQ ID NO: 1443 
SEQ ID NO:693 and SEQ ID NO: 1444 
SEQ ID N0.694 and SEQ ID NO: 1445 
SEQ ID NO:695 and SEQ ID NO: 1446 
SEQ ID NO:696 and SEQ ID NO: 1447 
SEQ ID NO:697 and SEQ ID NO: 1448 
SEQ ID NO:698 and SEQ ID NO: 1449 
SEQ ID N0.699 and SEQ ED NO: 1450 
SEQ ID NO:700 and SEQ ID NO:1451 
SEQ ID NO:701 and SEQ ID NO: 1452 
SEQ ID NO:702 and SEQ ID NO: 1453 
SEQ ID NO:703 and SEQ ID NO: 1454 
SEQ ID NO:704 and SEQ ID NO: 1455 
SEQ ID NO:705 and SEQ ID NO: 1456 
SEQ ID NO:706 and SEQ ID NO: 1457 
SEQ ID NO:707 and SEQ ID NO: 1458 
SEQ ID NO:708 and SEQ ID NO: 1459 
SEQ ID NO:709 and SEQ ID NO:1460 
SEQ ID NO:710 and SEQ ID NO:1461 
SEQ ID NO:71 1 and SEQ ID NO: 1462 
SEQ ID NO:712 and SEQ ID NO: 1463 
SEQ ID NO:713 and SEQ ID NO: 1464 
SEQ ID N0.714 and SEQ ID NO: 1465 
SEQ ID NO:715 and SEQ ID NO: 1466 
SEQ ID NO:716 and SEQ ID NO:1467 
SEQ ID NO:717 and SEQ ID NO: 1468 
SEQ ID NO:718 and SEQ ID NO: 1469 
SEQ ID N0.719 and SEQ ID NO: 1470 
SEQ ID NO:720 and SEQ ID NO:1471 
SEQ ID NO:721 and SEQ ID NO: 1472 
SEQ ID NO:722 and SEQ ID NO: 1473 
SEQ ID NO:723 and SEQ ID NO: 1474 
SEQ ID N0.724 and SEQ ID NO: 1475 
SEQ ID NO:725 and SEQ ID NO: 1476 
SEQ ID NO:726 and SEQ ID NO: 1477 
SEQ ID NO:727 and SEQ ID NO: 1478 
SEQ ID NO:729 and SEQ ID NO: 1479 
SEQ ID NO-.730 and SEQ ID NO: 1480 
SEQ ID NO:73 1 and SEQ ID NO: 1481 
SEQ ID NO:732 and SEQ ID NO: 1482 
SEQ ID NO:733 and SEQ ID NO: 1483 
SEQ ID NO:734 and SEQ ID NO: 1484 
SEQ ID N0.735 and SEQ ID NO: 1485 
SEQ ID NO:736 and SEQ ID NO: 1486 
SEQ ID NO:737 and SEQ ID NO: 1487 
SEQ ID NO:738 and SEQ ID NO: 1488 
SEQ ID NO:739 and SEQ ID NO: 1489 
SEQ ID NO-.740 and SEQ ID NO: 1490 
SEQ ID NO:741 and SEQ ID NO: 1491 
SEQ ID NO:745 and SEQ ID NO: 1492 
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SEQ ID NO:746 and SEQ ID NO: 1493 
SEQ ID NO:747 and SEQ ID NO: 1494 
SEQ ID NO:748 and SEQ ID NO: 1495 
SEQ ID NO:749 and SEQ ID NO: 1496 
SEQ ID NO:750 and SEQ ID NO: 1497 
SEQ ID NO:751 and SEQ ID NO: 1498 
SEQ ID NO:752 and SEQ ID NO: 1499 
SEQ ID NO:753 and SEQ ID NO: 1500 
SEQ ID NO:754 and SEQ ID NO: 1501 
SEQ ID NO:755 and SEQ ID NO: 1502 
SEQ ID NO:756 and SEQ ID NO: 1503 
SEQ ID NO:757 and SEQ ID NO: 1504 
SEQ ID NO:758 and SEQ ID NO: 1505 
SEQ ID NO:759 and SEQ ID NO: 1506 
SEQ ID NO:760 and SEQ ID NO: 1507 
SEQ ID NO:761 and SEQ ID NO: 1508 
SEQ ID NO:762 and SEQ ID NO: 1509 
SEQ ID NO:763 and SEQ ID NO: 1510 
SEQ ID NO:764 and SEQ ID NO: 151 1 
SEQ ED NO:765 and SEQ ID NO: 15 12 
SEQ ID NO:766 and SEQ ID NO: 1513 
SEQ ID NO:767 and SEQ ID NO: 1514 
SEQ ID NO:768 and SEQ ED N0:1515 
SEQ ID NO:769 and SEQ ID NO:1516 
SEQ ID NO:770 and SEQ ID NO:1517 
SEQ ID NO:771 and SEQ ID NO:1518 
SEQ ID NO:772 and SEQ ID NO: 1519 
SEQ ID NO:773 and SEQ ID NO: 1520 
SEQ ID NO:774 and SEQ ID NO: 1521 
SEQ ID NO:775 and SEQ ID NO: 1522 
SEQ ID NO:776 and SEQ ED NO: 1523 
SEQ ID NO:777 and SEQ ID NO: 1524 
SEQ ID NO:778 and SEQ ED NO: 1525 
SEQ ID NO:779 and SEQ ED NO: 1526 
SEQ ID NO:780 and SEQ ID NO: 1527 
SEQ ED NO:781 and SEQ ID NO:1528 
SEQ ID NO:782 and SEQ ID NO: 1529 
SEQ ID NO:783 and SEQ ID NO: 1530 
SEQ ID NO:784 and SEQ ED NO: 153 1 
SEQ ED NO:785 and SEQ ID NO: 1532 
SEQ ID NO:786 and SEQ ID NO: 1533 
SEQ ID NO:787 and SEQ ED NO: 1534 
SEQ ID NO:788 and SEQ ID NO: 1535 
SEQ ED NO:789 and SEQ ID NO: 1536 
SEQ ED NO:790 and SEQ ID NO: 1537 
SEQ ID NO:791 and SEQ ID NO:1538 
SEQ ED NO:792 and SEQ ED NO: 1539 
SEQ ED NO:793 and SEQ ID NO: 1540 
SEQ ID NO:794 and SEQ ID NO: 1541 
SEQ ID NO:795 and SEQ ID NO: 1542 
SEQ ID NO:797 and SEQ ID NO: 1543 
SEQ ID NO:798 and SEQ ID NO: 1544 
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SEQ ID NO:799 and SEQ ID NO: 1545 
SEQ ID NO:800 and SEQ ID NO: 1546 
SEQ ID NO:801 and SEQ ID NO: 1547 
SEQ ID NO:802 and SEQ ID NO: 1548 
SEQ ID NO:803 and SEQ ID NO: 1549 
SEQ ID NO:804 and SEQ ID NO: 1550 
SEQ ID NO:805 and SEQ ID NO: 1551 
SEQ ID NO:806 and SEQ ID NO: 1552 
SEQ ID NO-.807 and SEQ ID NO: 1553 
SEQ ID NO:809 and SEQ ID NO: 1554 
SEQ ID NO:810 and SEQ ID NO: 1555 
'5 SEQ ID N0:8 1 1 and SEQ ID NO: 1 556 

SEQ ID NO:812 and SEQ ID NO:1557 
SEQ ID NO:813 and SEQ ID NO: 1558 
SEQ ID NO:814 and SEQ ED NO: 1559 
20 SEQ ID NO:816 and SEQ ID NO: 1560 

SEQ ID NO:817 and SEQ ID NO: 1561 
SEQ ID NO:818 and SEQ ID NO:1562 
SEQ ID NO:819 and SEQ ID NO:1563 
SEQ ID NO:820 and SEQ ID NO: 1564 
SEQ ID NO:822 and SEQ ID NO: 1565 
SEQ ID NO:823 and SEQ ID NO: 1566 
SEQ ID NO-.824 and SEQ ID NO: 1567 
SEQ ID NO:825 and SEQ ID NO: 1568 
SEQ ED NO:826 and SEQ ID NO: 1569 
SEQ ID NO:827 and SEQ ID NO: 1570 
SEQ ID NO:828 and SEQ ID NO: 1571 
SEQ ID NO:829 and SEQ ID NO: 1572, and 
35 SEO ID NO:2545 and SEO ID NO:2546 

4. A polynucleotide which can be synthesized with the primer set of claim 2 or 3. 

5. A polynucleotide comprising a coding region in the polynucleotide of claim 4. 

40 

6. A substantially pure protein encoded by polynucleotide of claim 4. 

7. A partial peptide of the protein of claim 6. 

45 8. An isolated polynucleotide selected from the group consisting of 

(a) a polynucleotide comprising a coding region of the nucleotide sequence set forth in any one of the following 
SEQ ID NOs: 

so 



25 



30 



55 
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SEQ ID NO:2547, SEQ ID NO:nnnn , SEQ ID NO:2551, SEQ ID NO:2553, SEQ ID NO:2555, 
SEQ ID NO.2557, SEQ ID NO.2558, SEQ ID NO:2560, SEQ ID NO.2562, SEQ ID NO:2564, 
SEQ ID NO:2566, SEQ ID NO:2568, SEQ ID NO:2570, SEQ ID NO:2572, SEQ ID NO:2574, 
SEQ ID NO.2576, SEQ ID NO:2578, SEQ ID NO:2580, SEQ ID NO:2581, SEQ ID NO:2583, 
SEQ ID NO:2584, SEQ ID NO:nnnn , SEQ ID NO:2588, SEQ ID NO:2590, SEQ ID NO:2592, 
SEQ ID NO:2594, SEQ ID NO:2596, SEQ ID NO:2598, SEQ ID NO:nnnn , SEQ ID NO:2602, 
SEQ ID NO:2604, SEQ ID NO:2606, SEQ ID NO:2607, SEQ ID NO:2609, SEQ ID NO:261 1, 
SEQ ID NO:2613, SEQ ID NO:2615, SEQ ID NO:2617, SEQ ID NO:2619, SEQ ID NO:2621, 
SEQ ID N0.2623, SEQ ID N0.2625, SEQ ID NO:2627, SEQ ID N0.2629, SEQ ID NO.2630, 
SEQ ID NO:2632, SEQ ID NO:2634, SEQ ID NO:2636, SEQ ID NO:2638, SEQ ID NO:2640, 
SEQ ID N0.2642, SEQ ID NO:2644, SEQ ID NO:2646, SEQ ID NO:2648, SEQ ID NO:2650, 
SEQ ID NO:2652, SEQ ID NO:2654, SEQ ID NO:2656, SEQ ID NO:2658, SEQ ID NO:2660, 
SEQ ID NO:2662, SEQ ID NO:2664, SEQ ID NO:2666, SEQ ID NO:2668, SEQ ID NO:2670, 
SEQ ID NO:2672, SEQ ID NO:2674, SEQ ID NO:2676, SEQ ID NO:2678, SEQ ID NO:2680, 
SEQ ID NO:2681, SEQ ID NO:2683, SEQ ID NO:2685, SEQ ID NO:2686, SEQ ID NO:2688, 
SEQ ID NO:2690, SEQ ID NO.2692, SEQ ID NO:2694, SEQ ID NO:2696, SEQ ID NO:2698, 
SEQ ID NO:2700, SEQ ID NO:2702, SEQ ID NO:2704, SEQ ID NO:2706, SEQ ID NO:2708, 
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SEQ ID NO:2710, SEQ ID NO:2712, SEQ ID NO:2714, SEQ ID NO:2715, SEQ ID NO:2717, 
SEQ ID NO:2718, SEQ ED NO:2719, SEQ ID NO:2721, SEQ ID NO:2722, SEQ ID NO:2724, 
SEQ ID NO:2726, SEQ ID NO:2728, SEQ ID NO:2730, SEQ ED NO:2732, SEQ ID NO:2734 
SEQ ID NO:2735, SEQ ID N0.2737, SEQ ID NO:2739, SEQ ID NO:nnnn , SEQ ID NO:2743, 
SEQ ID NO:2745, SEQ ID NO:2747, SEQ ID NO:2749, SEQ ID NO:2750, SEQ ED NO:2752, 
SEQ ID NO:2754, SEQ ID NO:2756, SEQ ID NO:2758, SEQ ID NO:2760, SEQ ED NO:2761, 
SEQ ID NO:nnnn , SEQ ID NO:2765, SEQ ID NO:2766, SEQ ID NO:2768, SEQ ID NO:2770, 
SEQ ID NO:2772, SEQ ID NO:2774, SEQ ED NO.2776, SEQ ID NO:2778, SEQ ED NO:2780, 
SEQ ID NO:2782, SEQ ID NO:2784, SEQ ID NO:2786, SEQ ID NO:2788, SEQ ID NO:2790, 
SEQ ID NO:2792, SEQ ID NO:2794, SEQ ID NO:2796, SEQ ID NO:2798, SEQ ID NO:2800, 
SEQ ID NO.2802, SEQ ID NO:2804, SEQ ID NO:2806, SEQ ID NO.2808, SEQ ID NO.2810, 
SEQ ID NO:2812, SEQ ID NO:2814, SEQ ID NO:2816, SEQ ID NO:2818, SEQ ID NO:2820, 
SEQ ID NO:2822, SEQ ID NO:2824, SEQ ID NO:2826, SEQ ID NO:2828, SEQ ED NO:2830, 
SEQ ID NO:2832, SEQ ID NO:2834, SEQ ED NO:2836, SEQ ID NO:2838, SEQ ID NO:2840, 
SEQ ID NO:2842, SEQ ID NO:2844, SEQ ED NO:2846, SEQ ID NO:2848, SEQ ID NO:2850, 
SEQ ID NO:2852, SEQ ID NO:2854, SEQ ID NO:2856, SEQ ID NO:2858, SEQ ID NO:2860, 
SEQ ID NO:2862, SEQ ID NO:2864, SEQ ID NO:2866, SEQ ED NO:2868, SEQ ID NO:2870, 
SEQ ID NO:2872, SEQ ID NO:2874, SEQ ID NO:2876, SEQ ID NO:2878, SEQ ID NO:2880, 
SEQ ID NO:2882, SEQ ID NO:2884, SEQ ID NO:2886, SEQ ID NO:2888, SEQ ID NO:2890, 
SEQ ID NO:2892, SEQ ID NO:2894, SEQ ID NO.2896, SEQ ID NO:2898. SEQ ID NO:2900, 
SEQ ID NO:2902, SEQ ID NO:2904, SEQ ID NO:2906, SEQ ID NO:2908, SEQ ID NO:2910, 
SEQ ID NO:2912, SEQ ID NO:2914, SEQ ID NO:2916, SEQ ID NO:2918, SEQ ID NO:2920, 
SEQ ID NO:2922, SEQ ID NO:2924, SEQ ID NO:2926, SEQ ID NO:2928, SEQ ID NO:2930 
SEQ ID NO:2932, SEQ ID NO:2934, SEQ ED NO:2936, SEQ ID NO:2938, SEQ ID NO:2940, 
SEQ ED NO:2942, SEQ ID NO:2944, SEQ ID NO:2946, SEQ ID NO:2948, SEQ ED NO.2950, 
SEQ ED NO-.2952, SEQ ED NO:2954, SEQ ID NO:2956, SEQ ID NO:2958, SEQ ED NO:2960, 
SEQ ID NO:2962, SEQ ID NO:nnim , SEQ ID NO:2966, SEQ ID NO:2967, SEQ ID NO:2969, 
SEQ ID NO:2971, SEQ ID NO:2973, SEQ ID NO:2975, SEQ ID NO:2977. SEQ ED NO:2979, 
SEQ ID NO:2981. SEQ ID NO:2983, SEQ ID NO:2985, SEQ ID NO:2987, SEQ ED NO:2989, 
SEQ ID NO:2991, SEQ ID NO:2993, SEQ ID NO:2995, SEQ ID NO:2997, SEQ ID NO:2999, 
SEQ ID NO:3001, SEQ ID NO:3003, SEQ ID NO:3005, SEQ ID NO:3007, SEQ ID NO:3009, 
SEQ ID NO:3011, SEQ ID NO:3013, SEQ ID NO.3015, SEQ ID NO-.3017, SEQ ED NO:3019, 
SEQ ID NO:3021, SEQ ID NO:3023, SEQ ID NO.3025, SEQ ED NO:3027, SEQ ID NO:3029, 
SEQ ID NO:3031, SEQ ID NO:3033, SEQ ID NO:3035, SEQ ID NO:3037, SEQ ID NO:3039, 
SEQ ID NO:nnnn , SEQ ID NO:nnnn , SEQ ID NO:3045, SEQ ID NO:3047, SEQ ID NO:3049, 
SEQ ID NO.-3051, SEQ ID NO:3053, SEQ ID NO:3055, SEQ ID NO:3057, SEQ ID NO:3059, 
SEQ ID NO:3061, SEQ ID NO:3063, SEQ ED NO:3065, SEQ ID NO:3067, SEQ ID NO:3069, 
SEQ ID NO:3071, SEQ ID NO:3073, SEQ ID NO:3075, SEQ ID NO:3077, SEQ ID NO.-3079, 
SEQ ID NO:3081, SEQ ID NO:3083, SEQ ID NO:3085, SEQ ID NO.3087, SEQ ID NO:3089, 
SEQ ED NO:3091, SEQ ED NO:3093, SEQ ID NO:3095, SEQ ID NO:3097, SEQ ID NO:3099, 
SEQ ID NO:3101, SEQ ID NO:3103, SEQ ID NO:3105, SEQ ID NO:3107, SEQ ID NO:3109, 
SEQ ID NO:31 1 1, SEQ ID NO:31 13, SEQ ID NO:31 15, SEQ ID NO:31 17, SEQ ID NO:31 19, 
SEQ ID NO:3121, SEQ ID NO:3123, SEQ ID NO:3125, SEQ ID NO:3127, SEQ ID NO:3129 
SEQ ID NO:3131, SEQ ID NO:3133, SEQ ID NO:3135, SEQ ID NO:3137, SEQ ED NO:3139, 
SEQ ID NO:3140, SEQ ID NO:3142, SEQ ID NO:3143, SEQ ID NO:3144, SEQ ID NO:3146, 
SEQ ID NO:3148, SEQ ID NO:3150, SEQ ED NO:3152, SEQ ID NO:3154, SEQ ID NO:3156, 
SEQ ID NO:3158, SEQ ID NO:nnnn , SEQ ID NO:3162, SEQ ID NO:3164, SEQ ID NO:3166, 
SEQ ID N0.3 168, SEQ ID NO.3170, SEQ ID N0.3172, SEQ ID N0.3174, SEQ ID NO:3176, 
SEQ ID NO:3178, SEQ ID NO:3180, SEQ ID NO:3181, SEQ ID NO:3183, SEQ ID NO:3185, 
SEQ ID NO:3187, SEQ ID NO:3188, SEQ ID NO:3189, SEQIDNO:3191, SEQ ID N0.3193, 
SEQ ID NO:3195, SEQ ID NO:3 196, SEQ ID NO:3198, SEQ ID NO:3200, SEQ ID NO:3202, 
SEQ ID NO.3204, SEQ ID NO:3206, SEQ ID NO:3208, SEQ ID NO:3210, SEQ ID NO:3212, 
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SEQ ID NO:3214, SEQ ID NO:nnnn , SEQ ID NO:3218, SEQ ID NO:3219, SEQ ID NO:3221, 
SEQ ID NO:3222, SEQ ID NO:3224 f SEQ ID NO:3226, SEQ ID NOrnnnn , SEQ ID NO:3230, 
SEQ ID NO:3232, SEQ ID NO:3234, SEQ ID NO:3236, SEQ ID NO:3238, SEQ ID NO:3240 f 
SEQ ID NO:3242, SEQ ID NO:3244, SEQ ID NO:3246, SEQ ID NO:nnnn , SEQ ID NO:3250, 
SEQ ID NO:3252, SEQ ID NO:3254, SEQ ID NO:3256, SEQ ID NO:3258, SEQ ID NO;3260 f 
SEQ ID NO:3262, SEQ ID NO:3264, SEQ ID NO:3266, SEQ ID NO:3268, SEQ ID NO:3270, 
SEQ ID NO:3272, SEQ ID NO:3274, SEQ ID NO:3276, SEQ ID NO:3278, SEQ ID NO:3280, 
SEQ ID NO:3282, SEQ ED NO:3284, SEQ ID NO:3286, SEQ ID NO:3288, SEQ ID NO:3290, 
SEQ ID NO:3292, SEQ ID NO:3294, SEQ ID NO:3296, SEQ ID NO:3298, SEQ ID NO:3300, 
SEQ ID NO:3301, SEQ ID NO:3303, SEQ ID NO:3305, SEQ ID NO:3307, SEQ ID NO:3309, 
SEQ ID NO:331 1, SEQ ID NO:3312, SEQ ID NO:3314, SEQ ID NO:3316, SEQ ID NO:3318 
SEQ ID NO:3320, SEQ ID NO:3322 t SEQ ID NO:3324, SEQ ID NO:3326, SEQ ID NO:3328, 
SEQ ID NO:3330, SEQ ID NO:3332, SEQ ID NO:3334, SEQ ID NO:3336, SEQ ID NO:3338, 
SEQ ID NO:nnnn , SEQ ID NO:3342, SEQ ID NO:3344, SEQ ID NO:3346, SEQ ID NO:3348, 
SEQ ID NO:3350, SEQ ID NO:3352, SEQ ID NO:3354, SEQ ID NO:3356, SEQ ID NO:3358, 
SEQ ID NO:3360, SEQ ID NO:3362, SEQ ID NO:3364, SEQ ID NO:3366, SEQ ID NO:3368, 
SEQ ID NO:3370, SEQ ID NO:3372, SEQ ID NO:3374, SEQ ID NO:3376, SEQ ID NO:3378, 
SEQ ID NO:3380, SEQ ID NO:3382, SEQ ID NO:3384, SEQ ID NO;3386, SEQ ID NO:3388, 
SEQ ID NO:3390, SEQ ID NO:3392, SEQ ID NO:3394, SEQ ID NO:3396, SEQ ID NO:3398, 
SEQ ID NO:3400, SEQ ID NO:3402, SEQ ID NO:3404, SEQ ID NO.nnnn , SEQ ID NO:3408, 
SEQ ID NO:3410, SEQ ID NO:341 1, SEQ ID NO:3413, SEQ ID NO:3415, SEQ ID NO:3417, 
SEQ ID NO:3419, SEQ ID NO:nnnn , SEQ ID NO:3423, SEQ ID NO:3425, SEQ ID NO:3427, 
SEQ ID NO:3429, SEQ ID NO:343i, SEQ ID NO:3433, SEQ ID NO:3435, SEQ ID NO:3437, 
SEQ ID NO:3439, SEQ ID NO:3441, SEQ ID NO:3443, SEQ ID NO:3445, SEQ ID NO:3447, 
SEQ ID NO:3449, SEQ ID NO:345 1, SEQ ID NO:3453, SEQ ID NO:3455, SEQ ID NO:3457, 
SEQ ID NO:3459, SEQ ID NO:3461, SEQ ID NO:3463, SEQ ID NO:3465, SEQ ID NO:3467, 
SEQ ID NO:3469, SEQ ID NO:3471, SEQ ID NO:3473, SEQ ID NO:3475, SEQ ID NO:3477, 
SEQ ID NO:3479, SEQ ID NO:3481, SEQ ID NO:3482, SEQ ID NO:3484, SEQ ED NO:3486, 
SEQ ID NO:3488, SEQ ID NO:3490. SEQ ID NO:3492, SEQ ID NO:3494> SEQ ED NO:3496, 
SEQ ID NO:3498, SEQ ID NO:3500, SEQ ID NO:3502, SEQ ID NO:3504, SEQ ID NO:3506, 
SEQ ED NO:3508, SEQ ID NO:3509, SEQ ID NO:3511, SEQ ID NO:3513, SEQ ID NO:3515 
SEQ ID NO:3517, SEQ ID NO:3519, SEQ ID NO:3521, SEQ ID NO:3523, SEQ ID NO:3525, 
SEQ ID NO:3527, SEQ ID NO:3529, SEQ ID NO:3531, SEQ ID NO:3533, SEQ ID NO:3535, 
SEQ ID NO:3537, SEQ ID NO:3539, SEQ ID NO:3541, SEQ ID NO:3543, SEQ ID NO:3545, 
SEQ ID NO:3547, SEQ ID NO:3549, SEQ ID NO:3551, SEQ ID NO:3553, SEQ ID NO:3555, 
SEQ ID NO:3557, SEQ ID NO:3559, SEQ ID NO:3561, SEQ ID NO:3563, SEQ ID NO:nnnn , 
SEQ ID NO:3567, SEQ ID NO:3569, SEQ ID NO:3571, SEQ ID NO:3573 f SEQ ID NO:3575, 
SEQ ID NO:3577, SEQ ID NO:3579 > SEQ ID NO:3581 , SEQ ID NO:3583, SEQ ID NO:3585, 
SEQ ID NO:3587, SEQ ID NO:3589 , SEQ ID NO:3595, SEQ ID NO:3597, SEQ ID NO:3599, 
SEQ U) NO:3601, SEQ ID NO:3603, SEQ ID NO:3605, SEQ ID NO:3607, SEQ ID NO:3609, 
SEQ ID NO:3610 ( SEQ ID NO:3611, SEQ ID NOI3612, SEQ ID NO:3613, SEQ ID NO:3615, 
SEQ ID NO:3617, SEQ ID NO:3619, SEQ ID NO:3621, SEQ ID NO:3622, SEQ ID NO:3624, 
SEQ ID NO:3626, SEQ ID NO:3627, SEQ ID NO:3628, SEQ ID NO:3629, SEQ ID NO:3631, 
SEQ ID NO:3633, SEQ ID NO:3634, SEQ ID NO:3636, SEQ ID NO:3638, SEQ ID NO:3640, 
SEQ ID NO:3642, SEQ ID NO:3644, SEQ ID NO:3645, SEQ ID NO:3647, SEQ ED NO:3649, 
SEQ ID NO:3651, SEQ ID NO:3653, SEQ ID NO:3655, SEQ ID NO:3657, SEQ ID NO:3658> 
SEQ ID NO:3660, SEQ ID NO:3662, SEQ ID NO:3664, SEQ ID NO:3666, SEQ ID NO:3668, 
SEQ ID NO:3670, SEQ ID 1*0:3672. SEQ ID NO:3674, SEQ ID NO:3676, SEQ ID NO:3678, 
SEQ ID NO:3680, SEQ ID NO:3682, SEQ ID NO:3683, SEQ ID NO:3685, SEQ ID NO:3686, 
SEQ ED NO:3688, SEQ ID NO:3690, SEQ ID NO:3692, SEQ ID N0.3694, SEQ ID NO:3696, 
SEQ ID NO:3698, SEQ ID NO:37()0, SEQ ID NO:3702 t SEQ ID NO:3704, SEQ ID NO:3706, 
SEQ ID NO:3708, SEQ ID NO:3710, SEQ ID NO:3712, SEQ ID NO:3714, SEQ ID NO:37l6, 
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SEQ ID NO:3718, SEQ ID NO:3720, SEQ ID NO:3722, SEQ ID NO:3724, SEQ ID NO:3726, 
SEQ ID NO:3728, SEQ ID NO:3730, SEQ ID NO:3732, SEQ ID NO:3734, SEQ ID NO:3736, 
SEQ ID NO:3738, SEQ ID NO:374(), SEQ ID NO:3741, SEQ ID NO:3743, SEQ ID N0.3745, 
SEQ ID N0.3747, SEQ ID NO:3749, SEQ ED NO:3751, SEQ ID NO:3753, SEQ ID NO:3755, 
SEQ ID NO:3756, SEQ ID NO:3758, SEQ ID NO:3760, SEQ ID NO:3761, SEQ ID NO:3763, 
SEQ ID NO:3765, SEQ ID N0.3767, SEQ ID N0.3769, SEQ ID NO.-3771, SEQ ID NO:3773, 
SEQ ID NO:3775, SEQ ID NO:3777 

SEQ ID NO:3779, SEQ ID NO:3781, SEQ ID NO:3783, SEQ ID NO:3785, SEQ ID NO:3787, 
SEQ ID NO:3789, SEQ ID NO:3791, SEQ ID NO:3793, SEQ ID NO:3795, SEQ ID NO:3797, 
SEQ ID NO:3799, SEQ ID NO:3801, SEQ ID NO:3803, SEQ ID NO:3805, SEQ ID NO:3806, 
SEQ ID NO.3808, SEQ ID NO:381(), SEQ ED NO:3812, SEQ ID N0.3814, SEQ ID N0.3816, 
SEQ ID NO:3818, SEQ ID NO:3820, SEQ ID NO:3822, SEQ ID NO:3824, SEQ ID NO:3826, 
SEQ ID NO:3828, SEQ ID NO:3830, SEQ ID NO:3832, SEQ ID NO:3834, SEQ ID NO:3836, 
SEQ ID NO:3838, SEQ ID NO:3840, SEQ ID NO:3842, SEQ ID NO:3844, SEQ ID NO:3846, 
SEQ ID NO:3848, SEQ ID NO:3852, SEQ ID NO:3854, SEQ ID NO:3856, SEQ ID NO:3858, 
SEQ ID NO:3860, SEQ ID NO:3862, SEQ ID NO:3864, SEQ ID NO:3866, SEQ ID NO:3868, 
SEQ ID NO:3870, SEQ ID NO:3872, SEQ ID NO:3874, SEQ ID NO:3876, SEQ ID NO:3878, 
SEQ ID NO:3880, SEQ ID NO:3882, SEQ ID NO:3884, SEQ ID NO:3886, SEQ ID NO:3888, 
SEQ ID NO:3890, SEQ ID NO:3892, SEQ ID NO:3894, SEQ ID NO:3896, SEQ ID NO:3898, 
SEQ ID NO:3900, SEQ ID NO:3902, SEQ ID NO:3904, SEQ ED NO:3906, SEQ ID NO:3908, 
SEQ ID NO:3910, SEQ ID NO:3912, SEQ ID NO:3914, SEQ ID NO:3916, SEQ ID NO:3918, 
SEQ ID NO:3920, SEQ ID NO:3922, SEQ ID NO:3924, SEQ ID NO:3926, SEQ ID NO:3928, 
SEQ ID NO:3930, SEQ ID NO:3932, SEQ ID NO:3934, SEQ ID NO:3936, SEQ ID NO:3938, 
SEQ ID NO:3940, SEQ ID NO:3942, SEQ ID NO:3944, SEQ ID NO:3946, SEQ ID NO:3948, 
SEQ ID NO:3950, SEQ ID NO:3952, SEQ ID NO:3954, SEQ ID NO:3956, SEQ ID NO:3958, 
SEQ ID NO:3960, SEQ ID NO:3962, SEQ ID NO:3964, SEQ ID NO:3965, SEQ ID NO:3966, 
SEQ ID NO:3968, SEQ ID NO:3970, SEQ ID N0.3972, SEQ ID NO:3974, SEQ ID NO:3976 
SEQ ID NO:3977, SEQ ID NO:3979, SEQ ID NO:3981, SEQ ID NO:3983, SEQ ID NO:3985, 
SEQ ID NO:3987, SEQ ID NO:3988, SEQ ID NO:3990, SEQ ID NO:3992, SEQ ID NO:3994, 
SEQ ID NO:3996, SEQ ID NO:3998, SEQ ED NO:4000, SEQ ID NO:4001, SEQ ID NO:4003, 
SEQ ID NO:4004, SEQ ID NO:4006, SEQ ED NO:4008, SEQ ID NO:4010, SEQ ID NO:4012, 
SEQ ID NO:4016, SEQ ID NO:4018, SEQ ID NQ:4020, SEQ ID NO:4022, SEQ ID NO:4024, 
SEQ ID NO:4026, SEQ ID NO:4028, SEQ ID NO:4030, SEQ ID NO:4032, SEQ ID NO:4034, 
SEQ ID NO:4036, SEQ ID NO:4038, SEQ ED NO:4040, SEQ ID NO:4042, SEQ ID NO:4044, 
SEQ ID NO:4046, SEQ ID NO:4048, SEQ ID NO:4050, SEQ ID NO:4052, SEQ ED NO:4054, 
SEQ ID NO:4056, SEQ ID NO:4058, SEQ ED NO:4060, SEQ ID NO:4062, SEQ ID NO:4064, 
SEQ ID NO:4066, SEQ ID NO:4068, SEQ tD NO:4070, SEQ ID NO:4072, SEQ ID NO:4074, 
SEQ ID NO:4076 

SEQ ID NO:4078, SEQ ID NO:4080, SEQ ID NO:4082, SEQ ED NO:4084, SEQ ID NO:4086, 
SEQ ID NO:4088, SEQ DD NO.4090, SEQ ID NO:4092, SEQ ID NO:4094, SEQ ID NO:4096, 
SEQ ID NO:4098, SEQ ED NO:4099, SEQ ID NO:4101, SEQ ID NO:4103, SEQ ID NO:4105, 
SEQ ID NO:4107, SEQ ED NO:4109, SEQ ID NO:41 1 1, SEQ ID NO:41 13, SEQ ID NO:41 15, 
SEQ ID NO:41 17, SEQ ID NO:41 19, SEQ ID NO:4121, SEQ ID NO:4123, SEQ ID NO:4125, 
SEQ ID NO:4127, SEQ ID NO:4129, SEQ ID NO:4131, SEQ ID NO:4132, SEQ ID NO:4134, 
SEQ ID NO:4136, SEQ ID NO:4137, SEQ ID NO:4139, SEQ ID NO:4140, SEQ ID NO:4142, 
SEQ ID NO:4144, SEQ ED NO:4146, SEQ ID NO:4148, SEQ ED NO:4150, SEQ ID NO:4152, 
SEQ ID NO:4154, SEQ ID NO:4155, SEQ ID NO:4157, SEQ ID NO:4159, SEQ ID NO:4161, 
SEQ ID NO.-4163, SEQ ID N0.4165 

SEQ ID NO:4167, SEQ ID NO:4168, SEQ ID NO:4170, SEQ ID NO:4172, SEQ ID NO:4174, 
SEQ ID NO:4176, and SEQ ID NO:4178 

(b) a polynucleotide comprising a nucleotide sequence encoding a protein comprising the amino acid sequence 
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set forth in any one of the following SEQ ID NOs: 

5 

w 

15 
20 
25 
30 
35 
40 
45 
50 
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SEQ ID NO:2548, SEQ ID NO:nnnn , SEQ ID NO:2552, SEQ ID NO:2554, SEQ ID NO:2556, 
SEQ ID NO:2559, SEQ ID NO:2561, SEQ ED NO:2563, SEQ ID NO:2565, SEQ ID NO:2567, 
SEQ ID NO:2569, SEQ ID N0.257], SEQ ID NO:2573, SEQ ID N0.2575, SEQ ED N0.2577, 
SEQ ID NO:2579, SEQ ID NO:2582, SEQ ED NO:2585, SEQ ID NO:nnnn , SEQ ED NO:2589, 
SEQ ID NO:2591, SEQ ID NO:2593, SEQ ED NO:2595, SEQ ID N0.2597, SEQ ED NO:2599, 
SEQ ED NO:nnnn , SEQ ID NO:2603, SEQ ED NO:2605, SEQ ID NO:2608, SEQ ID NO.2610, 
SEQ ID NO:2612, SEQ ID NO:2614, SEQ ID NO:2616, SEQ ID NO:2618, SEQ ID NO:2620, 
SEQ ID NO:2622, SEQ ID NO:2624, SEQ ID NO:2626, SEQ ED NO:2628, SEQ ID NO:2631, 
SEQ ID NO:2633, SEQ ID NO:2635, SEQ ID NO:2637, SEQ ED NO:2639, SEQ ID NO:2641, 
SEQ ID NO.-2643, SEQ ID N0.2645, SEQ ID NO:2647, SEQ ED NO:2649, SEQ ID NO:2651, 
SEQ ID NO:2653 ? SEQ ID NO:2655, SEQ ID NO:2657, SEQ ID NO:2659, SEQ ED NO:2661, 
SEQ ID NO:2663, SEQ ID NO:2665, SEQ ID NO:2667, SEQ ED NO:2669, SEQ ID NO:2671 , 
SEQ ID NO:2673, SEQ ID NO:2675, SEQ ID NO:2677, SEQ ID NO:2679, SEQ ID NO:2682, 
SEQ ID NO:2684, SEQ ED NO:2687, SEQ ID NO:2689, SEQ ED NO:2691, SEQ ED NO:2693, 
SEQ ED NO:2695, SEQ ID NO:2697, SEQ ID NO:2699, SEQ ID NO:2701, SEQ ID NO:2703, 
SEQ ID NO:2705, SEQ ID NO:2707, SEQ ID NO:2709, SEQ ID NO:271 1, SEQ ED NO:2713, 
SEQ ID NO:27I6, SEQ ID NO:2720, SEQ ID NO:2723, SEQ ID NO:2725, SEQ ID NO:2727, 
SEQ ED NO:2729, SEQ ID NO:2731, SEQ ED NO:2733, SEQ ID NO:2736, SEQ ED NO:2738, 
SEQ ED NO:2740, SEQ ID NO:nnnn , SEQ ID NO:2744, SEQ ID NO:2746, SEQ ID NO:2748, 
SEQ ED NO:2751, SEQ ID NO:2753, SEQ ID NO:2755, SEQ ID NO:2757, SEQ ID NO:2759 
SEQ ED NO:2762, SEQ ID NO:nnnn , SEQ ID NO:2767, SEQ ID NO:2769, SEQ ED NO:2771, 
SEQ ID NO:2773, SEQ ID NO:2775, SEQ ID NO:2777, SEQ ID NO:2779, SEQ ED NO:2781, 
SEQ ID NO:2783, SEQ ID NO:2785, SEQ ID NO:2787, SEQ ID NO:2789, SEQ ED NO:2791, 
SEQ ID NO:2793, SEQ ID NO:2795, SEQ ID NO:2797, SEQ ID NO:2799, SEQ ID NO:280I, 
SEQ ID NO:2803, SEQ ID NO:2805, SEQ ID NO:2807, SEQ ID NO:2809, SEQ ID NO:281 1, 
SEQ ID NO:2813, SEQ ID NO:2815, SEQ ID NO:2817, SEQ ID NO:2819, SEQ ED NO:2821, 
SEQ ED NO:2823, SEQ ID NO:2825, SEQ ID NO:2827, SEQ ID NO:2829, SEQ ED NO:2831, 
SEQ EO NO:2833, SEQ ID NO:2835, SEQ ED NO:2837, SEQ ID NO:2839, SEQ ID NO:2841, 
SEQ ID NO:2843, SEQ ID NO:2845, SEQ ID NO:2847, SEQ ID NO:2849, SEQ ID NO:2851, 
SEQ EO NO:2853, SEQ ID NO:285:5, SEQ ID NO:2857, SEQ ID NO:2859, SEQ ED NO:2861, 
SEQ ID NO:2863, SEQ ID NO:2865, SEQ ED NO:2867, SEQ ID NO:2869, SEQ ED NO:2871, 
SEQ ID NO:2873, SEQ ID NO:2875, SEQ ED NO:2877, SEQ ID NO:2879, SEQ ED NO:2881, 
SEQ ED NO.2883, SEQ ED NO:2885, SEQ ED NO:2887, SEQ ID NO:2889, SEQ ED NO:2891, 
SEQ ID NO:2893, SEQ ED NO:289.5, SEQ ID NO:2897, SEQ ID NO:2899, SEQ ID NO:2901, 
SEQ ID NO:2903, SEQ ID NO:2905, SEQ ID NO:2907, SEQ ID NO:2909, SEQ ED N0:29I1, 
SEQ ID NO:2913, SEQ ID NO:2915, SEQ ID NO:2917, SEQ ID NO:2919, SEQ ID NO:2921, 
SEQ ED NO:2923, SEQ ID NO:2925, SEQ ID NO:2927, SEQ ID NO:2929, SEQ ED NO:2931, 
SEQ ID NO:2933, SEQ ID NO:2935, SEQ ID NO:2937, SEQ ID NO:2939, SEQ ED NO:294I, 
SEQ ID NO:2943, SEQ ED NO:2945, SEQ ID NO:2947, SEQ ID NO:2949, SEQ ID NO:2951 , 
SEQ ED NO:2953, SEQ ED NO:2955, SEQ ED NO:2957, SEQ ID NO:2959, SEQ ED NO:2961 
SEQ ID NO:2963, SEQ ED NO:nnnn , SEQ ID NO:2968, SEQ ID NO:2970, SEQ ED NO.2972, 
SEQ ID NO:2974, SEQ ID NO:2976, SEQ ID NO:2978, SEQ ID NO:2980, SEQ ID NO:2982, 
SEQ ED NO:2984, SEQ ED NO:2986, SEQ ID NO:2988, SEQ ID NO:2990, SEQ ID NO:2992, 
SEQ ID NO:2994, SEQ ED NO:2996, SEQ ID NO:2998, SEQ ID NO:3000. SEQ ID NO:3002, 
SEQ ID NO:3004, SEQ ED NO:3006, SEQ ID NO:3008, SEQ ID NO:3010, SEQ ID NO:30I2, 
SEQ ID NO:3014, SEQ ED NO:3016, SEQ ID NO:3018, SEQ ID NO:3020, SEQ ID NO:3022, 
SEQ ID NO:3024, SEQ ED NO:3026, SEQ ID NO:3028, SEQ ED NO:3030, SEQ ID NO:3032, 
SEQ ED NO:3034, SEQ ED NO:3036, SEQ ID NO:3038, SEQ ID NO:3040, SEQ ID NO:nnnn , 
SEQ ID NO:nnnn , SEQ ID NO:3046, SEQ ID NO:3048, SEQ ID NO:3050, SEQ ID NO:3052, 
SEQ ID NO.3054, SEQ ID NO:3056, SEQ ID NO:3058, SEQ ID NO:3060, SEQ ID NO:3062, 
SEQ ID NO:3064, SEQ ID NO:3066, SEQ ID NO:3068, SEQ ID NO:3070, SEQ ID NO:3072, 
SEQ ED NO:3074, SEQ ID NO:3076, SEQ ID NO:3078, SEQ EO NO:3080, SEQ ID NO:3082, 
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SEQ ED NO:3084, SEQ ID NO:3086, SEQ ID NO:3088, SEQ ID NO:3090, SEQ ID NO:3092, 
SEQ ID NO:3094, SEQ ID NO:3096, SEQ ID NO:3098, SEQ ID NO:3100, SEQ ID NO:3102, 
SEQ ID NO:3104, SEQ ID NO:3106, SEQ ID NO:3108, SEQ ID NO:31 10, SEQ ID NO:31 12, 
SEQ ID NO:31 14, SEQ ID NO:31 16, SEQ ID NO:3118, SEQ ID NO:3120, SEQ ID NO:3122, 
SEQ ID NO:3124, SEQ ID NO:3126, SEQ ID NO:3128, SEQ ID NO:3130, SEQ ID NO:3132, 
SEQ ID NO:3134, SEQ ID NO:3136, SEQ ID NO:3138, SEQ ID NO:3141, SEQ ID NO:3145, 
SEQ ID NO:3147, SEQ ID NO:3149, SEQ ID NO:3151, SEQ ID NO:3153, SEQ ID NO:3155, 
SEQ ID NO:3157, SEQ ID NO:3159, SEQ ID NO:nnnn , SEQ ID NO:3163, SEQ ID NO:3165 
SEQ ID N0:3167, SEQ ID NO:3169, SEQ ID NO:3171, SEQ ID NO:3173, SEQ ID NO:3175, 
SEQ ID NO:3 177, SEQ ID NO:3179, SEQ ID NO:3182 f SEQ ID NO:3184, SEQ ID NO:3186, 
SEQ ID NO:3190, SEQ ID NO:3192, SEQ ID NO:3194, SEQ ID NO:3197, SEQ ID NO:3199, 
SEQ ID NO:3201, SEQ ID NO:3203, SEQ ID NO:3205, SEQ ED NO:3207, SEQ ID NO:3209, 
SEQ ID NO:321 1, SEQ ID NO:3213, SEQ ID NO:3215, SEQ ID NO:nnnn , SEQ ID NO:3220, 
SEQ ID NO:3223, SEQ ID NO:3225, SEQ ID NO:3227, SEQ ID NO:nnnn , SEQ ID NO:3231, 
SEQ ID NO:3233, SEQ ID NO:323.'>, SEQ ID NO:3237, SEQ ID NO:3239, SEQ ID NO:3241, 
SEQ ID NO:3243, SEQ ID NO:3245, SEQ ID NO:3247, SEQ ID NO:nnnn , SEQ ID NO:3251, 
SEQ ID NO:3253, SEQ ID NO:3255, SEQ ID NO:3257, SEQ ID NO:3259, SEQ ID NO:326l, 
SEQ ID NO:3263, SEQ ID NO:3265, SEQ ID NO:3267, SEQ ID NO:3269, SEQ ID NO:3271, 
SEQ ID NO:3273, SEQ ID NO:3275, SEQ ID NO:3277, SEQ ID NO:3279, SEQ ID NO:3281, 
SEQ ID NO:3283, SEQ ID NO:3285, SEQ ID NO:3287, SEQ ID NO:3289, SEQ ID NO:3291, 
SEQ ID NO:3293, SEQ ID NO:3295, SEQ ID NO:3297, SEQ ID NO:3299, SEQ ID NO:3302, 
SEQ ID NO:3304, SEQ ID NO:3306, SEQ ID NO:3308, SEQ ID NO:3310, SEQ ID NO:3313, 
SEQ ID NO:3315, SEQ ID NO:3317, SEQ ID NO:3319, SEQ ID NO:3321, SEQ ID NO:3323, 
SEQ ID NO:3325, SEQ ID NO:3327, SEQ ID NO:3329, SEQ ID NO:3331, SEQ ID NO:3333, 
SEQ ID NO:3335, SEQ ID NO:3337, SEQ ID NO:3339, SEQ ID NOrnnnn , SEQ ID NO:3343, 
SEQ ID NO:3345, SEQ ID NO:3347, SEQ ID NO:3349, SEQ ID NO:3351, SEQ ID NO:3353, 
SEQ ID NO:3355, SEQ ID NO:3357, SEQ ID NO:3359, SEQ ID NO:3361, SEQ ID NO:3363, 
SEQ ID NO:3365, SEQ ID NO:3367, SEQ ID NO:3369, SEQ ID NO:3371, SEQ ID NO:3373 
SEQ ID NO:3375, SEQ ID NO:3377, SEQ ID NO:3379, SEQ ID NO:3381, SEQ ID NO:3383, 
SEQ ID NO:3385, SEQ ID NO:3387, SEQ ID NO:3389, SEQ ID NO:3391, SEQ ID NO:3393, 
SEQ ID NO:3395, SEQ ID NO:3397, SEQ ID NO:3399, SEQ ID NO:3401, SEQ ID NO:3403, 
SEQ ID NO:3405, SEQ ID NO:nnna , SEQ ID NO:3409, SEQ ID NO:3412, SEQ ID NO:3414, 
SEQ ID NO:3416, SEQ ID NO:3418, SEQ ID NO:3420, SEQ ID NO:nnnn , SEQ ID NO:3424, 
SEQ ID NO:3426, SEQ ID NO:3428, SEQ ID NO:3430, SEQ ID NO:3432, SEQ ID NO:3434, 
SEQ ID NO:3436, SEQ ID NO:3438, SEQ ID NO:3440, SEQ ID NO:3442, SEQ ID NO:3444, 
SEQ ID NO:3446, SEQ ID NO:3448, SEQ ID NO:3450, SEQ ID NO:3452, SEQ ID NO:3454, 
SEQ ID NO:3456, SEQ ID NO:3458, SEQ ID NO:3460, SEQ ID NO:3462, SEQ ID NO:3464, 
SEQ ID NO:3466, SEQ ID NO:3468, SEQ ID NO:3470, SEQ ID NO:3472, SEQ ID NO:3474, 
SEQ ID NO:3476, SEQ ID NO:3478, SEQ ID NO:3480, SEQ ID NO:3483, SEQ ID NO:3485, 
SEQ ID NO:3487, SEQ ID NO:3489, SEQ ID NO:3491, SEQ ID NO:3493, SEQ ID NO:3495, 
SEQ ID NO:3497, SEQ ID NO:3499, SEQ ID NO:3501, SEQ ID NO:3503, SEQ ID NO:3505, 
SEQ ID NO:3507, SEQ ID NO:3510, SEQ ID NO:3512, SEQ ID NO:3514, SEQ ID NO:3516, 
SEQ ID NO:3518, SEQ ID NO:3520, SEQ ID NO:3522, SEQ ID NO:3524, SEQ ID NO:3526, 
SEQ ID NO:3528, SEQ ID NO:3530, SEQ ID NO:3532, SEQ ID NO:3534, SEQ ID NO:3536, 
SEQ ID NO:3538, SEQ ID NO;3540, SEQ ID NO:3542, SEQ ID NO:3544, SEQ ID NO:3546, 
SEQ ID NO:3548, SEQ ID NO:3550, SEQ ID NO:3552, SEQ ID NO:3554, SEQ ID NO:3556, 
SEQ ID NO:3558, SEQ ID NO:3560, SEQ ID NO:3562, SEQ ID NO:3564, SEQ ID NO:nnnn , 
SEQ ID NO:3568, SEQ ID NO:3570, SEQ ID NO:3572, SEQ ID NO:3574, SEQ ID NO:3576 
SEQ ID NO:3578, SEQ ID NO:3580, SEQ ID NO:3582, SEQ ID NO:3584, SEQ ID NO:3586, 
SEQ ID NO:3588, SEQ ID NO:3590 

SEQ ID NO:3596, SEQ ID NO:3598, SEQ ID NO:3600, SEQ ID NO:3602, SEQ ID NO;3604, 
SEQ ID NO:3606, SEQ ID NO:3608, SEQ ID NO:3614, SEQ ID NO:3616, SEQ ID NO:3618, 
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SEQ ID NO:3620, SEQ ID NO:3623, SEQ ID NO:3625, SEQ ID NO:3630, SEQ ID NO:3632, 
SEQ ID NO:3635, SEQ ID NO:3637, SEQ ID NO:3639, SEQ ID NO:3641, SEQ ID NO:3643, 
SEQ ID NO:3646, SEQ ID NO:3648, SEQ ED NO:3650, SEQ ID NO:3652, SEQ ID NO:3654, 
SEQ ID NO:3656, SEQ ID NO:3659, SEQ ED NO:3661, SEQ ID NO:3663, SEQ ID NO:3665, 
SEQ ID NO:3667, SEQ ID NO:3669, SEQ ID NO:3671, SEQ ID NO:3673, SEQ ID NO:3675, 
SEQ ID N0.3677, SEQ ID N0.3679, SEQ ID NO:3681, SEQ ID N0.3684, SEQ ID N0.3687, 
SEQ ID NO:3689, SEQ ID NO: 3691, SEQ ID NO:3693, SEQ ID NO:3695, SEQ ID NO:3697, 
SEQ ED NO.-3699, SEQ ID NO:3701, SEQ ID NO:3703, SEQ ID NO:3705, SEQ ED NO:3707, 
SEQ ED NO:3709, SEQ ID NO:3711, SEQ ED NO:3713, SEQ ED NO:3715, SEQ ED NO:3717, 
SEQ ID NO:3719, SEQ ID NO:3721, SEQ ED NO:3723, SEQ ID NO:3725, SEQ ED NO:3727, 
SEQ ED NO:3729, SEQ ID NO: 373 1, SEQ ID NO:3733, SEQ ED NO:3735, SEQ ED NO:3737, 
SEQ ID NO:3739, SEQ ID NO:3742, SEQ ED NO:3744, SEQ ID NO:3746, SEQ ID NO:3748, 
SEQ ID NO.-3750, SEQ ID N0.3752, SEQ ID NO:3754, SEQ ID NO:3757, SEQ ID NO:3759, 
SEQ ID NO:3762, SEQ ID NO:3764, SEQ ED NO:3766, SEQ ED NO:3768, SEQ ID NO:3770, 
SEQ ID NO:3772, SEQ ID NO:3774, SEQ ID NO:3776, SEQ ID NO:3778, SEQ ID NO:3780, 
SEQ ID NO:3782, SEQ ID NO:3784, SEQ ID NO:3786, SEQ ID NO:3788, SEQ ID NO:3790, 
SEQ ID NO:3792, SEQ ED NO:3794, SEQ ID NO:3796, SEQ ID NO:3798, SEQ ID NO:3800, 
SEQ ID NO:3802, SEQ ID NO:3804, SEQ ID NO:3807, SEQ ED NO:3809, SEQ ID NO:381 1 
SEQ ID NO:3813, SEQ ID NO:381:5, SEQ ID NO:3817, SEQ ID NO:3819, SEQ ID NO:3821, 
SEQ ID NO:3823, SEQ ID NO:3825, SEQ ID NO:3827, SEQ ID NO:3829, SEQ ID NO:383 1, 
SEQ ID NO:3833, SEQ ED NO:3835, SEQ ED NO:3837, SEQ ID NO:3839, SEQ ID NO:3841, 
SEQ ED NO:3843, SEQ ED NO:3845, SEQ ID NO:3847, SEQ ID NO:3849, SEQ ED NO:3853, 
SEQ ED NO:3855, SEQ ED NO:3857, SEQ ED NO:3859, SEQ ID NO:3861, SEQ ED NO:3863, 
SEQ ED NO:3865, SEQ ID NO:3867, SEQ ED NO:3869, SEQ ED NO:3871, SEQ ED NO:3873, 
SEQ ID NO:3875, SEQ ID NO.3877, SEQ ID NO:3879, SEQ ID NO:3881, SEQ ED NO:3883, 
SEQ ID NO:3885, SEQ ID NO:3887, SEQ ED NO:3889, SEQ ED NO:3891, SEQ ID NO:3893, 
SEQ ED NO:3895, SEQ ID NO:3897, SEQ ED NO:3899, SEQ ED NO:3901 , SEQ ID NO.3903, 
SEQ ED NO:3905, SEQ ID NO:3907, SEQ ED NO:3909, SEQ ED NO:3911, SEQ ID NO:3913, 
SEQ ID NO:3915, SEQ ID NO:3917, SEQ ED NO:3919, SEQ ID NO:3921, SEQ ID NO:3923, 
SEQ ED NO:3925, SEQ ID NO:3927, SEQ ED NO:3929, SEQ ED NO:3931, SEQ ID NO:3933, 
SEQ ID NO:3935, SEQ ID NO:3937, SEQ ID NO:3939, SEQ ED NO:3941, SEQ ID NO:3943, 
SEQ ID NO:3945, SEQ ID NO:3947, SEQ ID NO:3949, SEQ ID NO:3951, SEQ ID NO:3953, 
SEQ ID NO-.3955, SEQ ID NO:3957, SEQ ID NO:3959, SEQ ID NO:3961, SEQ ID NO:3963, 
SEQ ID N0.3967, SEQ ED NO:3969, SEQ ID N0.3971, SEQ ID NO:3973, SEQ ID NO:3975, 
SEQ ID NO:3978, SEQ ID NO:3980, SEQ ID NO: 3982, SEQ ID NO:3984, SEQ ID NO:3986, 
SEQ ID NO:3989, SEQ ID NO:3991, SEQ ID NO:3993, SEQ ED NO:3995, SEQ ID NO:3997, 
SEQ ID NO:3999, SEQ ID NO:4002, SEQ ID NO:4005, SEQ ID NO:4007, SEQ ID NO:4009, 
SEQ ID NO:4011, SEQ ID NO:4013, SEQ ID NO:4017, SEQ ID NO:4019, SEQ ID NO:4021 
SEQ ID NO-.4023, SEQ ED NO:4025, SEQ ID NO:4027, SEQ ED NO:4029, SEQ ID NO:4031, 
SEQ ID NO-.4033, SEQ ID NO:4035, SEQ ED NO.4037, SEQ ID NO:4039, SEQ ID NO:4041, 
SEQ ID NO:4043, SEQ ID NO:4045, SEQ ID NO.4047, SEQ ID NO.4049, SEQ ID NO:4051, 
SEQ ID NO:4053, SEQ ID NO:4035, SEQ ID NO:4057, SEQ ID NO:4059, SEQ ID NO:4061, 
SEQ ID NO:4063, SEQ ID NO:40(55, SEQ ID NO:4067, SEQ ID NO:4069, SEQ ID NO:4071, 
SEQ ID NO:4073, SEQ ID NO-.4075, SEQ ID NO:4077 

SEQ ID NO:4079, SEQ ID NO:4081, SEQ ID NO:4083, SEQ ID NO:4085, SEQ ID NO:4087, 
SEQ ID NO:4089, SEQ ID NO:4091, SEQ ID NO:4093, SEQ ID NO:4095, SEQ ID NO:4097, 
SEQ ID NO:4100, SEQ ID NO-.4102, SEQ ID NO.4104, SEQ ID NO:4106, SEQ ID NO:4108, 
SEQ ID NO.-4110, SEQ ID N0:41 12, SEQ ID N0:41 14, SEQ ID N0.-41 16, SEQ ID N0:41 18, 
SEQ ID NO:4120, SEQ ID NO:4122, SEQ ID NO:4124, SEQ ID NO:4126, SEQ ID N0.4128, 
SEQ ID NO:4130, SEQ ED N0:4i 33, SEQ ID NO:4135, SEQ ID NO:4138, SEQ ID N0.4141, 
SEQ ID NO:4143, SEQ ID NO:4145, SEQ ID NO:4147, SEQ ID NO:4149, SEQ ID NO:4151, 
SEQ ID NO:4153, SEQ ID NO:4156, SEQ ID NO:4158, SEQ ID NO:4160, SEQ ID NO:4162, 
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SEQ ID NO:4164, SEQ ID NO:4166 

SEQ ID NO:4169, SEQ ID NO:4171, SEQ ID NO:4173, SEQ ID NO:4175, SEQ ID NO:4177, 
5 and SEQ ID NO:4179 

(c) a polynucleotide comprising a nucleotide sequence encoding a protein comprising an amino acid sequence 
selected from the amino acid sequences of (b), in which one or more amino acids are substituted, deleted, 
inserted, and/or added, wherein said protein is functionally equivalent to the protein comprising said amino 

10 acid sequence selected from the amino acid sequences of (b); 

(d) a polynucleotide that hybridizes with a polynucleotide comprising a nucleotide sequence selected from the 
nucleotide sequences of (a), and that comprises a nucleotide sequence encoding a protein functionally equiv- 
alent to the protein encoded by the nucleotide sequence selected from the nucleotide sequences of (a); 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a protein 
15 encoded by the polynucleotide of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence with at least 70% identity to the nucleotide sequence 
of (a). 

9. A substantially pure protein encoded by the polynucleotide of claim 8. 

20 

10. An antibody against the protein or peptide of any one of claims 6, 7, and 9. 

11. A vector comprising the polynucleotide of claim 5 or 8. 

25 12. A transformant carrying the polynucleotide of claim 5 or 8 ( or the vector of claim 11 . 

13. A transformant expressively carrying the polynucleotide of claim 5 or 8, or the vector of claim 11 . 

14. A method for producing the protein or peptide of any one of claims 6, 7, and 9, comprising culturing the transformant 
30 of claim 1 3 and recovering the expression product. 

15. An oligonucleotide comprising the nucleotide sequence of claim 8 (a) or the nucleotide sequence complementary 
to the complementary strand thereof, wherein said oligonucleotide comprises 15 nucleotides or more. 

35 16. Use of the oligonucleotide of claim 15 as a primer for synthesizing a polynucleotide. 

17. Use of the oligonucleotide of claim 15 as a probe for detecting a gene. 

18. An antisense polynucleotide against the polynucleotide of claim 8, or the portion thereof. 

40 

19. A method for synthesizing a polynucleotide, the method comprising: 

a) synthesizing a complementary strand using a cDNA library as a template, and using the primer set of claim 
2 or 3, or the primer of claim 16; and 
45 b) recovering the synthesized product. 

20. The method of claim 19, wherein the cDNA library is obtainable by oligo-capping method. 

21. The method of claim 19, wherein the complementary strand is obtainable by PCR. 

50 

22. A method for detecting the polynucleotide of claim 8, the method comprising: 

a) incubating a target polynucleotide with the oligonucleotide of claim 15 under the conditions where hybridi- 
zation occurs, and 

55 b) detecting the hybridization of the target polynucleotide with the oligonucleotide of claim 15. 

23. A database of polynucleotides and/or proteins, the database comprising information on at least one sequence 
selected from the nucleotide sequences of claim 8 (a) and/or the amino acid sequences of claim 8 (b), or a medium 
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on which the database is stored. 
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Figure 1 
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Figure 2 
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